-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Specify what constitutes white-space characters #69
Comments
Actually, the standard does supply a list of whitespace characters in [lex.pptoken]p2:
and again in [lex.token]p1:
|
Note that 'new-line' there is already a term of art. It possibly includes
various combinations of
- U+000A: LINE FEED (LF)
- U+000D: CARRIAGE RETURN (CR)
…On Tue, Mar 23, 2021 at 4:14 PM Tom Honermann ***@***.***> wrote:
Actually, the standard does supply a list of whitespace characters in
[lex.pptoken]p2 <http://eel.is/c++draft/lex#pptoken-2>:
... Preprocessing tokens can be separated by whitespace; this consists of
comments ([lex.comment]), or whitespace characters (space, horizontal tab,
new-line, vertical tab, and form-feed), or both. ...
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
<#69 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAVNZ5UUC4SBXUC5D4SFHEDTFDZCFANCNFSM4ZV4FG4Q>
.
|
Later revisions of P2295 no longer address this. |
P2348 - of which an early draft is there https://isocpp.org/files/papers/D2348R0.pdf rewords the handling of whitspaces and new lines without extending the set |
This issue was discussed on the Unicode.org mailing list. There was a recommendation from a Unicode expert that, for programming languages, |
The total feedback was a single response, though. |
The C++ standard defines behavior that depends on whether a character constitutes white-space, but never defines what those characters are. Uses of the "whitespace" and "white-space" terms appear in:
P2178 proposal 2 sought to clarify the set of characters that constitute white-space and proposed the following set. These characters all satisfy the immutable
Pattern_White_Space
property (see UAX #44 and/or search forPattern_White_Space
in the UCD).The above set of characters excludes the following characters that satisfy the (not immutable)
White_Space
property (see UAX #44 and/or search forWhite_Space
in the UCD).When addressing this issue, we may want to take the opportunity to replace the existing "whitespace" and "white-space" terminology with "blank space"; ISO guidance may require such a renaming in the future.
The text was updated successfully, but these errors were encountered: