Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

vdk-impala: Fix parsing while analysing profile for lineage information #1206

Merged
merged 6 commits into from
Oct 5, 2022

Conversation

kostoww
Copy link
Contributor

@kostoww kostoww commented Oct 3, 2022

Why?

Checking for lineage information is gathered from the profile as seeking for WRITE TO HDFS and SCAN HDFS keywords, but currently it doesn't mark for lineage tables which are parsed from a view, ie if using (SELECT * FROM table t ) instead of (SELECT * FROM table) will not bring lineage info in front.

What?

Changed the regex pattern for scan and write

How has this been tested?

Unit and integration tests, also with production-grade queries

What type of change are you making?

Bug-fixing

Signed-off-by: Plamen Kostov [email protected]

…ift profile for lineage information

# Why?
Checking for lineage information is gathered from the profile as seeking for WRITE TO HDFS and SCAN HDFS keywords, but currently it doesn't mark for lineage tables which are parsed from a view, ie if using (SELECT * FROM table t ) instead of (SELECT * FROM table) will not bring lineage info in front.

# What?
Changed the regex pattern for scan and write

# How has this been tested?
Unit and integration tests, also with production-grade queries

# What type of change are you making?
Bug-fixing

Signed-off-by: Plamen Kostov <[email protected]>
@kostoww kostoww enabled auto-merge (squash) October 3, 2022 18:37
@kostoww kostoww disabled auto-merge October 4, 2022 07:51
@kostoww kostoww enabled auto-merge (rebase) October 4, 2022 07:52
Plamen Kostov and others added 3 commits October 4, 2022 11:02
…ift profile for lineage information

# Why?
Checking for lineage information is gathered from the profile as seeking for WRITE TO HDFS and SCAN HDFS keywords, but currently it doesn't mark for lineage tables which are parsed from a view, ie if using (SELECT * FROM table t ) instead of (SELECT * FROM table) will not bring lineage info in front.

# What?
Changed the regex pattern for scan and write

# How has this been tested?
Unit and integration tests, also with production-grade queries

# What type of change are you making?
Bug-fixing

Signed-off-by: Plamen Kostov <[email protected]>
…ift profile for lineage information

# Why?
Checking for lineage information is gathered from the profile as seeking for WRITE TO HDFS and SCAN HDFS keywords, but currently it doesn't mark for lineage tables which are parsed from a view, ie if using (SELECT * FROM table t ) instead of (SELECT * FROM table) will not bring lineage info in front.

# What?
Changed the regex pattern for scan and write

# How has this been tested?
Unit and integration tests, also with production-grade queries

# What type of change are you making?
Bug-fixing

Signed-off-by: Plamen Kostov <[email protected]>
@kostoww kostoww force-pushed the topic/kostoww/fix-vdk-impala-parsing-profile branch from e0bdd54 to bc87f3c Compare October 5, 2022 14:10
@kostoww kostoww merged commit a5831f4 into main Oct 5, 2022
@kostoww kostoww deleted the topic/kostoww/fix-vdk-impala-parsing-profile branch October 5, 2022 16:20
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants