Experiment implementing po_extract() #243

hadley · 2021-11-06T15:21:34Z

Mostly to trigger discussion. More comments below.

R/po_scan.R

And display number of messages found

R/po_scan.R

MichaelChirico · 2021-11-06T22:30:56Z

po_scan() doesn't feel like a great name to me... maybe po_snapshot()?

R/get_message_data.R

R/get_r_messages.R

MichaelChirico · 2021-11-06T22:43:07Z

R/get_r_messages.R

    # treat gettextf separately since it takes a named argument, and we ignore ...
-    get_named_arg_strings(expr_data, 'gettextf', c(fmt = 1L), recursive = TRUE),
+    get_named_arg_strings(expr_data, fmt_funs, c(fmt = 1L), recursive = TRUE),
+    if (use_tr) get_named_arg_strings(expr_data, 'tr_', c(string = 1L), recursive = TRUE),


an aside: tr_ might be a good opportunity to build something I'd been kind of stuck on:

Since R doesn't have parser-level string concatenation (i.e. C's 'abc' 'def') making translation-ready strings in R will tend to produce lintr-violating wide lines (esp. at 80 but also at 120 characters)... the tooling for base has no shot here because it considers all the inputs as individual strings.

But with custom functions we could allow:

tr_( 'abcd', 'efg' )

To wind up as abcdefg in the .pot file & also run gettext() on the combined string:

tr_ <- function(...) gettext(paste0(...))

I think it would require some effort to get it working but maybe worthwhile to prevent infinite scroll strings...

Yeah, I like that idea. I'll change it to a dots function.

In total we might want something like:

tr_ <- function(...) { structure(enc2utf8(gettext(paste0(...), domain = "R-pkgdown")), class = "translated") }

That ensures it's correctly flagged as UTF-8 (which gettext()) doesn't do by default, and flags it as a translated string which might be useful for other functions.

This function could also error if given anything other than a literal string (i.e. tr_(foo) and tr_(foo()) should error because they're not translatable).

that sounds basically right to me (requiring literal inputs), I do wonder if it will wind up being limiting. I have in mind the N_() construct in C which is needed to keep strings untranslated at compile time, then translated as a variable at run time.

https://github.com/wch/r-source/blob/430360aa394b02046dd7175c0fc55e347e947f80/src/main/errors.c#L1398-L1419

https://github.com/wch/r-source/blob/430360aa394b02046dd7175c0fc55e347e947f80/src/main/errors.c#L1461

Not sure if it's relevant, so probably it's OK to start being strict and allow the use cases to show themselves as a feature request.

hadley · 2021-11-07T13:53:27Z

Hmmm, it seems like we're cueing off different parts of this function — it both looks for (scans) and records (snapshots) messages for translation. I wonder if we can somehow get both ideas into one word ... How about po_extract()?

hadley · 2021-11-07T21:00:42Z

Fixes #223. Fixes #226. Fixes #229.

MichaelChirico · 2021-11-07T22:20:28Z

po_extract() sounds better to me -- both make clearer the output of the function... the scanning part feels more "incidental"

R/get_message_data.R

R/po_scan.R

R/translate_package.R

Conflicts: R/get_message_data.R man/get_message_data.Rd

hadley · 2021-11-08T13:06:37Z

Ok, I think this is pretty close to being done apart from the documentation — the problem is that po_extract() and get_message_data()share all of their arguments but there's no way to@inheritParams` from an .Rd file in the same package. I think there are two options:

Convert get_message_data() to use roxygen2
Duplicate the docs

Which would you prefer?

MichaelChirico · 2021-11-08T16:37:57Z

I'm happy to gradually converge on roxygen2 everywhere

hadley · 2021-11-08T18:59:08Z

@MichaelChirico if you want, I can do a separate PR that converts all the .Rds to roxygen (it's 95% automated). Do you want to use @export too or stick with the explicit NAMESPACE?

MichaelChirico · 2021-11-09T07:25:05Z

That works, esp. if it'll make it easier for you to be productive going forward (and assuming it's not a huge time drain for you).

I would keep an explicit NAMESPACE, I do prefer being more intentional there.

I was reading `style = 'base'` as the call which was throwing me off. an explicit argument here should help readability.

MichaelChirico · 2021-11-09T07:37:43Z

R/po_extract.R

+  po_params = list(
+    package = desc[['Package']],
+    version = desc[['Version']],
+    copyright = NULL,


is your sense that these fields (copyright and bugs) aren't really needed?

my sense as well has been these things are basically covered by the DESCRIPTION file already, and kept it to try & be consistent with base (and it's something of a maintenance headache to implement)

Yeah, I think we should remove them; it's better to leave to the DESCRIPTION and avoid creating duplicates that might get out of date.

hadley · 2021-11-09T11:50:44Z

To be clear, I meant using @export to generate the NAMESPACE, so it's still intentional, but you while you're near the source of the function you can tell if it's exported or not.

MichaelChirico · 2021-11-09T15:59:47Z

right -- I like that but the @import / @importFrom less so.

how about using the NAMESPACE roclet but keeping all the @import/@importFrom tags in a central place (e.g. the @doctype package or .onLoad)?

hadley · 2021-11-09T18:00:03Z

Sure. Our current convention is to put them with the package docs anyway.

hadley added 4 commits November 6, 2021 09:54

Extract po_scan()

6cb76f6

Inform selectively about R/src messages

a70b9bf

Optionally exclude condition functions

69bdd72

Build in tr_ support + add "explicit" style

06e1197

hadley commented Nov 6, 2021

View reviewed changes

R/po_scan.R Outdated Show resolved Hide resolved

hadley added 4 commits November 6, 2021 10:35

Also scan for messagef() etc

d2fe235

And display number of messages found

Delete accidentally committed .pot

05425a4

Add tr/tr_ to known translators

04aca15

Consistent function names

410f83c

hadley commented Nov 6, 2021

View reviewed changes

R/po_scan.R Outdated Show resolved Hide resolved

MichaelChirico reviewed Nov 6, 2021

View reviewed changes

R/get_message_data.R Show resolved Hide resolved

MichaelChirico reviewed Nov 6, 2021

View reviewed changes

R/get_r_messages.R Outdated Show resolved Hide resolved

MichaelChirico reviewed Nov 6, 2021

View reviewed changes

R/get_r_messages.R Outdated Show resolved Hide resolved

MichaelChirico reviewed Nov 6, 2021

View reviewed changes

hadley added 3 commits November 7, 2021 07:44

Treat tr_ as a dots function

da3ab28

Pull use_tr into own block

655700e

Make domain_fmt_funs consistent

ef21080

MichaelChirico reviewed Nov 7, 2021

View reviewed changes

R/get_message_data.R Outdated Show resolved Hide resolved

%chin%

c3bc2c9

MichaelChirico reviewed Nov 7, 2021

View reviewed changes

R/po_scan.R Outdated Show resolved Hide resolved

typo

e401af0

MichaelChirico reviewed Nov 7, 2021

View reviewed changes

R/translate_package.R Show resolved Hide resolved

hadley added 4 commits November 8, 2021 06:29

Rename to po_extract()

dc077fd

Push style argument down to get_r_messages()

c28af76

Start on docs

743cb45

Merge commit '583db6471e8a272a461006717855cee32ba82dd8'

dab11d7

Conflicts: R/get_message_data.R man/get_message_data.Rd

hadley added 4 commits November 8, 2021 06:54

Also need to update usage

750e805

Invisibly return the message data

ee6f642

Read style from DESCRIPTION if not set

b600b24

Update test

df938eb

hadley changed the title ~~Experiment implementing po_scan()~~ Experiment implementing po_extract() Nov 8, 2021

hadley added 3 commits November 8, 2021 12:56

Convert get_message_data.Rd to roxygen2

bbd4b43

Inherit params in po_extract()

e00ea69

WS

9e923e3

Merge branch 'master' into po_scan

eefbc6d

MichaelChirico added 5 commits November 8, 2021 23:27

ws

fbe527a

explicit argument for readability

0615c70

I was reading `style = 'base'` as the call which was throwing me off. an explicit argument here should help readability.

clarify docs

eb85e5f

update verbose default

2f56145

TODO comment for later

46cef27

MichaelChirico reviewed Nov 9, 2021

View reviewed changes

update .rd

ca58ca0

MichaelChirico merged commit bd9c91a into MichaelChirico:master Nov 9, 2021

hadley deleted the po_scan branch November 9, 2021 11:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Experiment implementing po_extract() #243

Experiment implementing po_extract() #243

hadley commented Nov 6, 2021

MichaelChirico commented Nov 6, 2021

MichaelChirico Nov 6, 2021

hadley Nov 7, 2021

hadley Nov 7, 2021 •

edited

Loading

MichaelChirico Nov 7, 2021

hadley commented Nov 7, 2021

hadley commented Nov 7, 2021 •

edited

Loading

MichaelChirico commented Nov 7, 2021

hadley commented Nov 8, 2021

MichaelChirico commented Nov 8, 2021

hadley commented Nov 8, 2021

MichaelChirico commented Nov 9, 2021

MichaelChirico Nov 9, 2021

hadley Nov 9, 2021

hadley commented Nov 9, 2021

MichaelChirico commented Nov 9, 2021

hadley commented Nov 9, 2021

Experiment implementing po_extract() #243

Experiment implementing po_extract() #243

Conversation

hadley commented Nov 6, 2021

MichaelChirico commented Nov 6, 2021

MichaelChirico Nov 6, 2021

Choose a reason for hiding this comment

hadley Nov 7, 2021

Choose a reason for hiding this comment

hadley Nov 7, 2021 • edited Loading

Choose a reason for hiding this comment

MichaelChirico Nov 7, 2021

Choose a reason for hiding this comment

hadley commented Nov 7, 2021

hadley commented Nov 7, 2021 • edited Loading

MichaelChirico commented Nov 7, 2021

hadley commented Nov 8, 2021

MichaelChirico commented Nov 8, 2021

hadley commented Nov 8, 2021

MichaelChirico commented Nov 9, 2021

MichaelChirico Nov 9, 2021

Choose a reason for hiding this comment

hadley Nov 9, 2021

Choose a reason for hiding this comment

hadley commented Nov 9, 2021

MichaelChirico commented Nov 9, 2021

hadley commented Nov 9, 2021

hadley Nov 7, 2021 •

edited

Loading

hadley commented Nov 7, 2021 •

edited

Loading