-
Notifications
You must be signed in to change notification settings - Fork 94
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
unicode #2979
Comments
suite names@oliver-sanders - suite names would be a good one to nail down right away. They are currently completely unrestricted, as you've noted. Motivation: @dwsutherland needs a delimiter for these IDs in the code: The minimal set of Unicode characters is probably alphanumeric plus dash, underscore, forward-slash:
Then @dwsutherland could use Is there any compelling reason to allow more than this in suite names? |
The only other characters that we may want to support are |
OK, I've quickly run some characters from different unicode tables against Python The regex:
Changes we might want to make:
Conclusions:
I think we are good to go with this Regex? Matching Charsbasic latinabc explictly supported special chars_.+@- latin supplementàðØ latin extendedĐĵŌ IPA extensionsɐɶʍ greek and copticΘχϢ anchient greek numbers𐅅𐅉𐅌 cryllicФШѸ armenianԱՔփ Non-Matching Charsemoji☺ special chars꘍ / < |
How about |
The colon |
Exactly! |
See also #3117 where I investigated & documented task & (to some extent) suite names that work normally, which is now in the docs here & here. I suggest we have the same limitations on suite name as on task name, as documented above, as largely in agreement with the comments above. Note there is also a maximum length that suite names can safely be, given the OS restriction on file names length. It might sound a bit over the top to point this out, but I have seen some exceedingly long suite names across the MO. Can we validate on length too for extra care? |
We can check for number of characters easily enough (can even do it in a regex), unfortunately most file-system file-name limits are in bytes. https://en.wikipedia.org/wiki/Comparison_of_file_systems#Limits We could just slap on a 255 character limit anyway. |
See open questions on #3274 |
Could we use |
Note - The other reason why I was originally vying for |
I suppose we could (but it looks annoyingly like pipe or OR to me). I think we can go with the possibly-temporary solution and make a final decision once allowed suite-name chars are defined (within a day or so, I expect). |
All items addressed, closing. |
Follow on from #2966 to cover unicode support in Cylc.
What unicode should Cylc support and where?
exit
" #2281)The text was updated successfully, but these errors were encountered: