-
Notifications
You must be signed in to change notification settings - Fork 3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[DATA] Remove repeated chars during preprocessing #7739
[DATA] Remove repeated chars during preprocessing #7739
Conversation
Thanks for your contribution! |
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## develop #7739 +/- ##
========================================
Coverage 56.56% 56.56%
========================================
Files 589 589
Lines 89964 89964
========================================
Hits 50889 50889
Misses 39075 39075 ☔ View full report in Codecov by Sentry. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
1ec3fea
to
93ac4fa
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
PR types
Function optimization
PR changes
Others
Description
Remove repeated chars during preprocessing.
remove_repeated_chars()
to remove the repeated chars.max_repeated_len
to control the max length.max_repeated_len
in README.md.