-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature] Add an option to search input files recursively in ingestion job. The default is set to true to be backward compatible. #9265
Conversation
I don't think it is safe to add this flag and set the default option to false since current behavior is searching recursively by default. It is dangerous to change default behavior since we don't know what we will break. |
Updated the default searchRecursive option to be true. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
thanks for the contribution @yaostartree , this looks great overall. I have some minor comments. please kindly take a look
...-common/src/main/java/org/apache/pinot/common/segment/generation/SegmentGenerationUtils.java
Outdated
Show resolved
Hide resolved
pinot-spi/src/test/java/org/apache/pinot/spi/ingestion/batch/IngestionJobLauncherTest.java
Outdated
Show resolved
Hide resolved
pinot-spi/src/test/java/org/apache/pinot/spi/ingestion/batch/IngestionJobLauncherTest.java
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lgtm.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good job extracting the common code into the util
Is the pinot-plugins.tar.gz
file required for this PR? I feel it is mistakenly included
...in/java/org/apache/pinot/plugin/ingestion/batch/hadoop/HadoopSegmentGenerationJobRunner.java
Show resolved
Hide resolved
pinot-spi/src/main/java/org/apache/pinot/spi/ingestion/batch/spec/SegmentGenerationJobSpec.java
Show resolved
Hide resolved
pinot-spi/src/main/java/org/apache/pinot/spi/ingestion/batch/spec/SegmentGenerationJobSpec.java
Show resolved
Hide resolved
...-common/src/main/java/org/apache/pinot/common/segment/generation/SegmentGenerationUtils.java
Outdated
Show resolved
Hide resolved
...-common/src/main/java/org/apache/pinot/common/segment/generation/SegmentGenerationUtils.java
Outdated
Show resolved
Hide resolved
...main/java/org/apache/pinot/plugin/ingestion/batch/spark/SparkSegmentGenerationJobRunner.java
Show resolved
Hide resolved
...ain/java/org/apache/pinot/plugin/ingestion/batch/spark3/SparkSegmentGenerationJobRunner.java
Show resolved
Hide resolved
...-common/src/main/java/org/apache/pinot/common/segment/generation/SegmentGenerationUtils.java
Outdated
Show resolved
Hide resolved
pinot-spi/src/main/java/org/apache/pinot/spi/ingestion/batch/BatchConfig.java
Outdated
Show resolved
Hide resolved
a5082ad
to
da7aad0
Compare
Yeah. Removed the pinot-plugins.tar.gz |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, great job!
searchRecursively option is added in jobSpec. It is added to prevent the non-necessary recursive search for input files.
Currently, the option is set to true by default since that is existing behavior.
In next PR, we will change the default to be false after clients are aware of this change.