[release/5.0] Handle non-ASCII strings in GetNonRandomizedHashCodeOrdinalIgnoreCase #45062

github-actions · 2020-11-21T23:07:17Z

Backport of #44688 and #44695 to release/5.0

Customer Impact

Dictionary does not work with ignore-case ordinal comparer for certain non-English languages. Customer reported regression from 3.1.

Testing

Targeted test added

Risk

Medium

Dotnet-GitSync-Bot · 2020-11-21T23:07:20Z

I couldn't figure out the best area label to add to this PR. If you have write-permissions please help me learn by adding exactly one area label.

jkotas · 2020-11-21T23:16:01Z

cc @danmosemsft

danmoseley · 2020-11-23T22:16:28Z

src/libraries/System.Collections/tests/Generic/Dictionary/Dictionary.Tests.cs

+        public void DictionaryOrdinalIgnoreCaseCyrillicKeys()
+        {
+            const string Lower = "абвгдеёжзийклмнопрстуфхцчшщьыъэюя";
+            const string Higher = "АБВГДЕЁЖЗИЙКЛМНОПРСТУФХЦЧШЩЬЫЪЭЮЯ";


Nit, coding-style.md

When including non-ASCII characters in the source code use Unicode escape sequences (\uXXXX) instead of literal characters. Literal non-ASCII characters occasionally get garbled by a tool or editor.

No need to fix it in this PR though.

I fully appreciate where this guideline came from, but the trouble with it - it makes the code significantly harder to read and maintain (pertaining to this specific example).

Yeah. IMO when our unit tests use strings in non-English languages, we should just write the literal strings unescaped in their original language. If our unit tests are instead testing "I put a non-ASCII character here, let's see what happens!", then it's good to put the "\uXXXX" explicitly, since it draws attention to the fact that there's something unique about the character at this index in the string.

I agree it's harder to read -- the concern is if garbelizing could somehow allow the test to continue to pass, without anyone notice.

Another option is to follow it with a Debug.Assert that compares them to their escaped forms. That way you maintain the readability, but garbelizing will immediately fail the test.

I don't feel strongly - if you think it is impossible that we would not notice.

danmoseley · 2020-11-24T01:39:04Z

Failure was #41511 in one case, and an infrastructure issue in the other. Rerunning failed legs.

EgorBo and others added 19 commits November 21, 2020 23:07

Fix GetNonRandomizedHashCodeOrdinalIgnoreCase

a2ee0e6

Add a test

e2c54f5

correct (but slow) fix

3f814ef

clean up

dde5a94

Update String.Comparison.cs

5aa4605

Update String.Comparison.cs

d9f9199

Update String.Comparison.cs

3cf6806

Address feedback and fix test

c5d7dda

undo change in tests

6cc065a

Address feedback

da7e87b

Clean up

0f14588

Address feedback

0de8fef

Address feedback

84dd975

Minor optimizations: manual loop unswitching

8aace29

Update internal comparers & out-of-bounds regression tests

65a5834

fix bad merge

05b5a8d

Address Jan's feedback

5f61d43

don't pass hash1 and hash2

8b27e6d

Update Dictionary.Tests.cs

be0d9a4

jkotas requested a review from GrabYourPitchforks November 21, 2020 23:14

jkotas added the Servicing-consider Issue for next servicing release review label Nov 21, 2020

jkotas approved these changes Nov 21, 2020

View reviewed changes

jkotas added the area-System.Runtime label Nov 22, 2020

GrabYourPitchforks mentioned this pull request Nov 22, 2020

Dictionary sometimes uses Ordinal hash code calculation instead of OrdinalIgnoreCase #44695

Closed

This was referenced Nov 23, 2020

OSX deprovision jaredpar/runfo#41

Closed

OSX machines are de-provisioned during CI / PR runs leading to failures #34472

Closed

danmoseley reviewed Nov 23, 2020

View reviewed changes

GrabYourPitchforks approved these changes Nov 23, 2020

View reviewed changes

stephentoub approved these changes Nov 24, 2020

View reviewed changes

GrabYourPitchforks added Servicing-approved Approved for servicing release and removed Servicing-consider Issue for next servicing release review labels Nov 24, 2020

GrabYourPitchforks merged commit d9a4a64 into release/5.0 Nov 24, 2020

GrabYourPitchforks deleted the backport/pr-44688-to-release/5.0 branch November 24, 2020 05:55

steveisok mentioned this pull request Nov 25, 2020

ConnectAsync_CancellationRequestedAfterConnect_ThrowsOperationCanceledException #41511

Closed

ghost locked as resolved and limited conversation to collaborators Dec 24, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[release/5.0] Handle non-ASCII strings in GetNonRandomizedHashCodeOrdinalIgnoreCase #45062

[release/5.0] Handle non-ASCII strings in GetNonRandomizedHashCodeOrdinalIgnoreCase #45062

github-actions bot commented Nov 21, 2020 •

edited by GrabYourPitchforks

Loading

Dotnet-GitSync-Bot commented Nov 21, 2020

jkotas commented Nov 21, 2020

danmoseley Nov 23, 2020

RussKie Nov 23, 2020

GrabYourPitchforks Nov 23, 2020

danmoseley Nov 23, 2020

danmoseley commented Nov 24, 2020

[release/5.0] Handle non-ASCII strings in GetNonRandomizedHashCodeOrdinalIgnoreCase #45062

[release/5.0] Handle non-ASCII strings in GetNonRandomizedHashCodeOrdinalIgnoreCase #45062

Conversation

github-actions bot commented Nov 21, 2020 • edited by GrabYourPitchforks Loading

Customer Impact

Testing

Risk

Dotnet-GitSync-Bot commented Nov 21, 2020

jkotas commented Nov 21, 2020

danmoseley Nov 23, 2020

Choose a reason for hiding this comment

RussKie Nov 23, 2020

Choose a reason for hiding this comment

GrabYourPitchforks Nov 23, 2020

Choose a reason for hiding this comment

danmoseley Nov 23, 2020

Choose a reason for hiding this comment

danmoseley commented Nov 24, 2020

github-actions bot commented Nov 21, 2020 •

edited by GrabYourPitchforks

Loading