[WDL Biscayne] Move escape evaluation into Cromwell (and make it work) #4427

cjllanwarne · 2018-11-27T16:27:32Z

NB will require approval of openWDL PR [Hermes Grammar] Implement string escapes for #247 openwdl/wdl#272
On merge, notify openWDL that Specify UTF-8 encoding and tidy up string definition openwdl/wdl#247 can be merged

Problems with the old method:

Quotes were not being escaped
- eg this would fail: String s = "a\"b"
Hex codes were not being interpreted by wdl_unescape:
- eg this would fail: String hex_hello = "\x68\x65\x6C\x6c\x6F"
Double length unicode codepoints were not being parsed:
- eg this would fail: String unicode_hello = "\u0068\U00000065\u006C\U0000006C\u006F"

New method:

Parse escape sequences in hermes grammar
Allow Cromwell to interpret the escape sequences as it needs to

The main benefit to this was not having to mess with the wdl_unescape method in hermes to get the unrespected escape types to work.

aednichols · 2018-11-27T17:14:22Z

wdl/model/draft3/src/main/scala/wdl/model/draft3/elements/StringEscapeSequence.scala

+  val FourDigitUnicode = "\\\\u([0-9a-fA-F]{4})".r
+  val EightDigitUnicode = "\\\\U([0-9a-fA-F]{8})".r
+
+  def parseEscapeSequence(seq: String): ErrorOr[StringEscapeSequence] = seq match {


We test a wider range of escape sequences in string_escaping.wdl - will these still work?

Of course, we are allowed to change the behavior, with consensus - I doubt they see much use.

I'm not touching the WDL 1.0 parser in this change, which means that any WDL 1.0 string literal runs through the hermes-encapsulated wdl_unescape function and becomes a plain old :string.

This code will only run over :escape tokens.

And yes, it's deliberate that the set is smaller in Biscayne - per openwdl/wdl#247

To be clear I referred to the 1.0 file because it's just a convenient list to point at

geoffjentry · 2018-11-27T21:47:22Z

@cjllanwarne Please remind me when this is merged as IMO this, plus your new openwdl PR fully satisfy the implementation requirement for the underlying wdl spec PR

aednichols

👍 LGTM

kshakir · 2018-12-13T16:24:39Z

womtool/src/test/scala/womtool/WomtoolValidateSpec.scala

@@ -13,7 +13,6 @@ class WomtoolValidateSpec extends FlatSpec with Matchers {

  behavior of "womtool validate"

-


Unless you have strong opinions on this line, I'd revert this to keep the file from showing up in the change set.

Because IMO two-line-breaks are worthy of shrinking (but mainly to avoid having to re-run this PR through Travis) I'll leave this in unless you strongly object?

kshakir · 2018-12-13T16:32:39Z

wdl/model/draft3/src/main/scala/wdl/model/draft3/elements/ExpressionElement.scala

+  sealed trait StringEscapeSequence extends StringPiece {
+    def unescape: String
+  }
+  case object NewlineEscape extends StringEscapeSequence { override val unescape: String = System.lineSeparator }


I would either change parseEscapeSequence() to also use something like case System.lineSeparator, or change this to be override val unescape = "\\n".

Otherwise on some systems the values may not match. Even if we don't support those systems, I still prefer consistency.

They're actually subtly different - parseEscapeSequence is looking at what escape sequence was in the WDL (eg if my script has String s = "\n") - this line is specifying what value to replace the escape sequence with in the resulting scala String value

cjllanwarne added 5 commits November 21, 2018 14:48

Parse and read character escape sequences

c039104

Evaluators and tests for escaped literals

36db836

Evaluators and tests for escaped code points

56148a4

Evaluators and tests for escaped code points

ca062c6

Simplify structure

7120ef6

cjllanwarne added the 🌲Redwood label Nov 27, 2018

Add tests for single quote escapes too

589f7b3

cjllanwarne mentioned this pull request Nov 27, 2018

[Hermes Grammar] Implement string escapes for #247 openwdl/wdl#272

Merged

aednichols reviewed Nov 27, 2018

View reviewed changes

cjllanwarne added 2 commits November 27, 2018 12:51

Fix evaluation of static strings, test UTF8 encoded files

025426e

Complete match statements to allow compilation

3b7f04d

rebrown1395 assigned cjllanwarne Dec 4, 2018

aednichols approved these changes Dec 11, 2018

View reviewed changes

rebrown1395 added the 👍Red Thumb Required 🙏 label Dec 12, 2018

rebrown1395 requested a review from kshakir December 13, 2018 15:22

kshakir approved these changes Dec 13, 2018

View reviewed changes

cjllanwarne merged commit be1ee39 into develop Dec 13, 2018

cjllanwarne deleted the cjl_biscayne_escapes branch December 13, 2018 22:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WDL Biscayne] Move escape evaluation into Cromwell (and make it work) #4427

[WDL Biscayne] Move escape evaluation into Cromwell (and make it work) #4427

cjllanwarne commented Nov 27, 2018 •

edited

Loading

aednichols Nov 27, 2018

cjllanwarne Nov 27, 2018 •

edited

Loading

aednichols Nov 27, 2018

geoffjentry commented Nov 27, 2018

aednichols left a comment

kshakir Dec 13, 2018

cjllanwarne Dec 13, 2018

kshakir Dec 13, 2018

cjllanwarne Dec 13, 2018

		@@ -13,7 +13,6 @@ class WomtoolValidateSpec extends FlatSpec with Matchers {

		behavior of "womtool validate"

[WDL Biscayne] Move escape evaluation into Cromwell (and make it work) #4427

[WDL Biscayne] Move escape evaluation into Cromwell (and make it work) #4427

Conversation

cjllanwarne commented Nov 27, 2018 • edited Loading

aednichols Nov 27, 2018

Choose a reason for hiding this comment

cjllanwarne Nov 27, 2018 • edited Loading

Choose a reason for hiding this comment

aednichols Nov 27, 2018

Choose a reason for hiding this comment

geoffjentry commented Nov 27, 2018

aednichols left a comment

Choose a reason for hiding this comment

kshakir Dec 13, 2018

Choose a reason for hiding this comment

cjllanwarne Dec 13, 2018

Choose a reason for hiding this comment

kshakir Dec 13, 2018

Choose a reason for hiding this comment

cjllanwarne Dec 13, 2018

Choose a reason for hiding this comment

cjllanwarne commented Nov 27, 2018 •

edited

Loading

cjllanwarne Nov 27, 2018 •

edited

Loading