Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Handle fields missing in the source in ParquetNativeRecordReader #7742

Merged
merged 3 commits into from
Nov 11, 2021

Conversation

npawar
Copy link
Contributor

@npawar npawar commented Nov 11, 2021

ParquetNativeRecordReader was not handling fields which were missing at source. This is a common case when transform functions are applied, that the record reader gets a list of fields and not all of them may be in the source.
Modified test to catch this case.

@npawar npawar requested a review from mayankshriv November 11, 2021 02:14
@codecov-commenter
Copy link

codecov-commenter commented Nov 11, 2021

Codecov Report

Merging #7742 (941d659) into master (920093a) will increase coverage by 1.33%.
The diff coverage is 100.00%.

Impacted file tree graph

@@             Coverage Diff              @@
##             master    #7742      +/-   ##
============================================
+ Coverage     70.18%   71.52%   +1.33%     
- Complexity     4034     4061      +27     
============================================
  Files          1575     1577       +2     
  Lines         80289    80542     +253     
  Branches      11938    11967      +29     
============================================
+ Hits          56353    57608    +1255     
+ Misses        20061    19053    -1008     
- Partials       3875     3881       +6     
Flag Coverage Δ
integration1 29.24% <0.00%> (?)
integration2 27.84% <0.00%> (+0.09%) ⬆️
unittests1 68.60% <ø> (+0.10%) ⬆️
unittests2 14.57% <100.00%> (-0.02%) ⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files Coverage Δ
...utformat/parquet/ParquetNativeRecordExtractor.java 68.00% <100.00%> (+7.20%) ⬆️
.../inputformat/protobuf/ProtoBufRecordExtractor.java 78.65% <100.00%> (+1.12%) ⬆️
...ache/pinot/common/tier/PinotServerTierStorage.java 70.00% <0.00%> (-30.00%) ⬇️
.../segment/spi/compression/ChunkCompressionType.java 80.00% <0.00%> (-20.00%) ⬇️
...che/pinot/common/utils/config/TierConfigUtils.java 45.83% <0.00%> (-15.28%) ⬇️
...c/main/java/org/apache/pinot/common/tier/Tier.java 88.88% <0.00%> (-11.12%) ⬇️
...inot/common/tier/TimeBasedTierSegmentSelector.java 87.50% <0.00%> (-5.84%) ⬇️
...lix/core/realtime/PinotRealtimeSegmentManager.java 78.75% <0.00%> (-5.19%) ⬇️
...ment/spi/loader/SegmentDirectoryLoaderContext.java 70.00% <0.00%> (-5.00%) ⬇️
...or/impl/fwd/SingleValueVarByteRawIndexCreator.java 80.95% <0.00%> (-3.26%) ⬇️
... and 133 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 920093a...941d659. Read the comment docs.

@xiangfu0 xiangfu0 merged commit af01aa5 into apache:master Nov 11, 2021
xiangfu0 pushed a commit that referenced this pull request Nov 11, 2021
* Fix ParquetNativeRecordExtractor for fields missing in the source

* nit

* Same bug in proto
kriti-sc pushed a commit to kriti-sc/incubator-pinot that referenced this pull request Dec 12, 2021
…che#7742)

* Fix ParquetNativeRecordExtractor for fields missing in the source

* nit

* Same bug in proto
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants