We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Performance
ParquetWriter already supports bloom filter encoding, but we have to apply query clauses to bloom filters during table scan.
ParquetWriter
Once we can build external index file, we may also switch to xor filter and it's rust implementation for better performance.
The text was updated successfully, but these errors were encountered:
@v0y4g3r Any progress?
Sorry, something went wrong.
@v0y4g3r What's the plan for this issue? I am not sure if we still need it.
IMO, we should do some benchmarks to compare with the inverted index later as parquet already supports it.
We already implemented skipping data index.
No branches or pull requests
What type of enhancement is this?
Performance
What does the enhancement do?
ParquetWriter
already supports bloom filter encoding, but we have to apply query clauses to bloom filters during table scan.Once we can build external index file, we may also switch to xor filter and it's rust implementation for better performance.
The text was updated successfully, but these errors were encountered: