Avoid crash in simplifyRanges by removing subsets up front #6459

smikula · 2024-08-15T17:56:57Z

What's the problem this PR addresses?

Resolves #6373. The problem is that simplifyRanges doesn't correctly reduce redundant OR ranges. For example, ~1.0.1 || ~1.0.2 should be simplified to ~1.0.1. As the algorithm runs, it will effectively calculate every combination of terms in such ranges. For example, given two ranges like ~1.0.1 || ~1.0.2, the nextAlternatives array will end up with 2*2 = 4 entries; if you have 100 such ranges you'll end up with 2^100 entries. Growing exponentially like this it's not hard to crash the process.

Arguably packages should not specify peer deps with this sort of redundant range, but sometimes they do (I'm working on cleaning up my project now that I know what the problem is!) Regardless, yarn shouldn't crash when it happens.

How did you fix it?

At the beginning of simplifyRanges, I reduce any range of this sort by splitting it apart and using sember.subset to check if one part of the range is a subset of another, in which case it can be excluded from the simplified range. I short circuit if the range only has one term, to avoid any excess parsing.

I think this is the right fix, but I'm happy to take feedback or hand it off if someone knows better. (Maybe @arcanis as author of this code?)

Checklist

I have read the Contributing Guide.

I have set the packages that need to be released for my changes to be effective.

I will check that all automated PR checks pass before the PR gets reviewed.

arcanis · 2024-08-24T12:48:49Z

packages/yarnpkg-core/sources/semverUtils.ts

@@ -204,7 +204,7 @@ export function stringifyComparator(comparator: Comparator) {
 }

 export function simplifyRanges(ranges: Array<string>) {
-  const parsedRanges = ranges.map(range => validRange(range)!.set.map(comparators => comparators.map(comparator => getComparator(comparator))));
+  const parsedRanges = ranges.map(removeSubsets).map(range => validRange(range)!.set.map(comparators => comparators.map(comparator => getComparator(comparator))));


Can we rather do a filter on the set after we already parsed it, rather than the hardcoded || split followed by semver.subset calls (to avoid parsing the ranges twice)?

I considered this, but semver.subset only accepts a string or a Range object. If I have the parsed range, then I need to compare the members of Range.set that look like:

[ [ Comparator { options: {}, loose: false, operator: '>=', semver: [SemVer], value: '>=1.0.1' }, Comparator { options: {}, loose: false, operator: '<', semver: [SemVer], value: '<1.1.0-0' } ], [ Comparator { options: {}, loose: false, operator: '>=', semver: [SemVer], value: '>=1.0.2' }, Comparator { options: {}, loose: false, operator: '<', semver: [SemVer], value: '<1.1.0-0' } ] ]

So then I need to either:

Implement my own version of subset (yuck), or...

Reconstruct a proper range out of the above without any more parsing, which really gets into the guts of semver

Either case seems like it would add a lot of complexity for marginal benefit. Given that I already short-circuit on ranges that don't have ||, I'm expecting the extra cost here is pretty minimal.

What do you think?

packages/yarnpkg-core/tests/semverUtils.test.ts

smikula · 2024-09-12T22:37:32Z

@arcanis Any further thoughts here?

arcanis · 2024-09-13T13:51:19Z

Looks good to me, thanks!

smikula and others added 4 commits August 15, 2024 10:11

Make simplifyRanges more efficient by removing subsets up front

e1a94ba

Fix lint errors

4ea5302

Run version check

8942826

Merge branch 'master' into remove-subsets

a430296

smikula changed the title ~~Make simplifyRanges more efficient by removing subsets up front~~ Avoid crash in simplifyRanges by removing subsets up front Aug 15, 2024

smikula marked this pull request as ready for review August 15, 2024 18:14

arcanis reviewed Aug 24, 2024

View reviewed changes

packages/yarnpkg-core/tests/semverUtils.test.ts Show resolved Hide resolved

smikula and others added 3 commits August 26, 2024 11:43

Add a test case that would crash the process

423090d

Merge branch 'master' into remove-subsets

1595262

Merge branch 'master' into remove-subsets

877abb7

Merge branch 'master' into remove-subsets

0a909f7

arcanis merged commit 758a8be into yarnpkg:master Sep 13, 2024
25 of 26 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid crash in simplifyRanges by removing subsets up front #6459

Avoid crash in simplifyRanges by removing subsets up front #6459

smikula commented Aug 15, 2024

arcanis Aug 24, 2024

smikula Aug 26, 2024

smikula commented Sep 12, 2024

arcanis commented Sep 13, 2024

Avoid crash in simplifyRanges by removing subsets up front #6459

Avoid crash in simplifyRanges by removing subsets up front #6459

Conversation

smikula commented Aug 15, 2024

What's the problem this PR addresses?

How did you fix it?

Checklist

arcanis Aug 24, 2024

Choose a reason for hiding this comment

smikula Aug 26, 2024

Choose a reason for hiding this comment

smikula commented Sep 12, 2024

arcanis commented Sep 13, 2024