-
Notifications
You must be signed in to change notification settings - Fork 38
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix: better tokenization #266
Conversation
This reverts commit 89361d1.
Wait, don't revert the whole thing, you just want to remove the case-switch tokenization. |
-separator: '[\s\-,:!?=\[\]()<>{}"/\\]+|(?!\b)(?=[A-Z][a-z])|\.(?!\d)|&[lg]t;'
+separator: '[\s\-,:!?=\[\]()<>{}"/\\]+|\.(?!\d)|&[lg]t;' |
The part this was not possible before either, so removing just this small part will restore what you were used to in addition keeping the majority of the fix for #264 |
@HonkingGoose @viceice wdyt? |
I'll defer to people that actually know regex, I just copy pasted the regex from @TWiStErRob 😄 If you know the exact term, you should be able to find it with the search. And you should also be able to find partial matches. |
Co-authored-by: Róbert Papp <[email protected]>
fix search