-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
PARQUET-2478: Update README with link to parquet website #1355
Conversation
Parquet uses the [record shredding and assembly algorithm](https://github.com/julienledem/redelm/wiki/The-striping-and-assembly-algorithms-from-the-Dremel-paper) described in the Dremel paper to represent nested structures. | ||
This repository contains a Java implementation of [Apache Parquet](https://parquet.apache.org/) | ||
|
||
Apache Parquet is an open source, column-oriented data file format |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is the same wording from apache/parquet-site#59
Update the introductory content to reduce confusion about parquet in general.
Parquet-MR contains the java implementation of the [Parquet format](https://github.com/apache/parquet-format). | ||
Parquet is a columnar storage format for Hadoop; it provides efficient storage and encoding of data. | ||
Parquet uses the [record shredding and assembly algorithm](https://github.com/julienledem/redelm/wiki/The-striping-and-assembly-algorithms-from-the-Dremel-paper) described in the Dremel paper to represent nested structures. | ||
This repository contains a Java implementation of [Apache Parquet](https://parquet.apache.org/) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think "a" makes more sense here too, especially given the discussion around what constitutes a reference implementation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Thanks!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Lgtm!
You can see the rendered version of this README here:
https://github.com/alamb/parquet-mr/tree/alamb/website_link?tab=readme-ov-file#parquet-mr-
Jira
them in the PR title. For example, "PARQUET-1234: My Parquet PR"
the ASF 3rd Party License Policy.
Tests
Commits
There is a single commit with a self explanatory description
from "How to write a good git commit message":
Style
There are no code changes
mvn spotless:apply -Pvector-plugins
Documentation
This PR has no java code changes, only markdown