Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve performance of formatting a project #2863

Merged
merged 37 commits into from
Jul 1, 2024
Merged

Conversation

janmasrovira
Copy link
Collaborator

@janmasrovira janmasrovira commented Jun 27, 2024

Currently formatting a project is equivalent to running juvix format on each individual file. Hence, the performance is quadratic wrt the number of modules in the project. This pr fixes that and we now we only process each module once.

Benchmark (1236% faster 🚀)

Checking the standard library

hyperfine --warmup 1 'juvix format --check' 'juvix-main format --check'
Benchmark 1: juvix format --check
  Time (mean ± σ):     450.6 ms ±  33.7 ms    [User: 707.2 ms, System: 178.7 ms]
  Range (min … max):   396.0 ms … 497.0 ms    10 runs

Benchmark 2: juvix-main format --check
  Time (mean ± σ):      6.019 s ±  0.267 s    [User: 9.333 s, System: 1.512 s]
  Range (min … max):    5.598 s …  6.524 s    10 runs

Summary
  juvix format --check ran
   13.36 ± 1.16 times faster than juvix-main format --check

Other changes:

  1. The EntryPoint field entryPointModulePath is now optional.
  2. I've introduced a new type TopModulePathKey which is analogous to TopModulePath but wihout location information. It is used in hashmap keys where the location in the key is never used. This is useful as we can now get a TopModulePathKey from a Path Rel File.
  3. I've refactored the _formatInput field in FormatOptions so that it doesn't need to be a special case anymore.
  4. I've introduced a new effect Forcing that allows to individually force fields of a record type with a convenient syntax.
  5. I've refactored some of the constraints in scoping so that they only require Reader Package instead of Reader EntryPoint.
  6. I've introduced a new type family so that local modules are no longer required to have ModuleId from their type. Before, they were assigned one, but it was never used.

Future work:

  1. For project-wise formatting, the compilation is done in parallel, but the formatting is still done sequentially. That should be improved.

@janmasrovira janmasrovira self-assigned this Jun 27, 2024
@janmasrovira janmasrovira force-pushed the improve-format-project branch 2 times, most recently from 1c983f7 to faeb2c3 Compare June 28, 2024 14:11
@janmasrovira janmasrovira force-pushed the improve-format-project branch from 16349b0 to 79c688e Compare June 28, 2024 16:25
@janmasrovira janmasrovira requested a review from paulcadman July 1, 2024 10:02
@janmasrovira janmasrovira marked this pull request as ready for review July 1, 2024 10:03
@janmasrovira janmasrovira force-pushed the improve-format-project branch from 6b378cd to 62f8e2b Compare July 1, 2024 14:03
@janmasrovira janmasrovira force-pushed the improve-format-project branch from 62f8e2b to e3d8df2 Compare July 1, 2024 14:05
@janmasrovira janmasrovira merged commit 6fcc9f2 into main Jul 1, 2024
4 checks passed
@janmasrovira janmasrovira deleted the improve-format-project branch July 1, 2024 16:05
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants