-
Notifications
You must be signed in to change notification settings - Fork 59
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Upgrade to Flink 1.15 & FLIP-27 Source APIs #37
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I've checked out this PR and build it locally, together with a locally build Flink 1.15. I've tried some of the SQL recipes but I'm getting some weird results on timestamps.
CREATE TABLE orders (
bidtime TIMESTAMP(3),
price DOUBLE,
item STRING,
supplier STRING,
WATERMARK FOR bidtime AS bidtime - INTERVAL '5' SECONDS
) WITH (
'connector' = 'faker',
'fields.bidtime.expression' = '#{date.past ''30'',''SECONDS''}',
'fields.price.expression' = '#{Number.randomDouble ''2'',''1'',''150''}',
'fields.item.expression' = '#{Commerce.productName}',
'fields.supplier.expression' = '#{regexify ''(Alice|Bob|Carol|Alex|Joe|James|Jane|Jack)''}',
'rows-per-second' = '100'
);
I'm getting the same result for every row:
One more question: since this version won't be compatible with Flink 1.14 and lower, will you release this with a major/minor version bump?
@MartijnVisser Thank you for testing.
|
@knaufk Yes, I was indeed testing with parallelism of 1. It also occurs when parallelism is set to 2 (and probably higher) |
@knaufk yes, I'm very sorry for this, but it seems a regression slipped in due to some aggressive caching. Going back to version 1.1.0 should fix this, and it's also fixed in 1.3.0, but that one hasn't been release yet (it's planned to be released in 7 days). My apologies for this! |
just for the history, the problem with the same data for every row was fixed at datafaker-net/datafaker#88 |
Thanks everyone. I've added a test to guard against this kind of regression in the future and downgraded to 1.1.0 for now. |
d5d760b
to
9e37222
Compare
I'm having another look. I do find it weird that after pulling your latest commits, I'm now getting a Flink related error. That's probably for the Flink community to look into because I don't expect that they should appear when running a
|
@MartijnVisser I suspect this somehow related to your local setup. Neither the CI nor me locally can reproduce this. |
@knaufk Yes, I needed to remove locally downloaded Maven artifacts |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the PR. I've confirmed that it works with Flink 1.15 RC0
@knaufk @MartijnVisser Datafaker 1.3.0 has been released, this fixes the issue related to the cached data generation. |
No description provided.