From 94120614998383db61534fd0debf51c1f3556bce Mon Sep 17 00:00:00 2001 From: Andrew Lamb Date: Sun, 19 May 2024 06:10:37 -0400 Subject: [PATCH] PARQUET-2478: Update README with link to parquet website Update the introductory content to reduce confusion about parquet in general. --- README.md | 16 ++++++++++++---- 1 file changed, 12 insertions(+), 4 deletions(-) diff --git a/README.md b/README.md index fa947935ba..74987cb943 100644 --- a/README.md +++ b/README.md @@ -20,11 +20,19 @@ Parquet MR [![Build Status](https://github.com/apache/parquet-mr/workflows/Test/badge.svg)](https://github.com/apache/parquet-mr/actions) ====== -Parquet-MR contains the java implementation of the [Parquet format](https://github.com/apache/parquet-format). -Parquet is a columnar storage format for Hadoop; it provides efficient storage and encoding of data. -Parquet uses the [record shredding and assembly algorithm](https://github.com/julienledem/redelm/wiki/The-striping-and-assembly-algorithms-from-the-Dremel-paper) described in the Dremel paper to represent nested structures. +This repository contains a Java implementation of [Apache Parquet](https://parquet.apache.org/) + +Apache Parquet is an open source, column-oriented data file format +designed for efficient data storage and retrieval. It provides high +performance compression and encoding schemes to handle complex data in +bulk and is supported in many programming language and analytics +tools. -You can find some details about the format and intended use cases in our [Hadoop Summit 2013 presentation](http://www.slideshare.net/julienledem/parquet-hadoop-summit-2013) +The [parquet-format](https://github.com/apache/parquet-format) +repository contains the file format specificiation. + +Parquet uses the [record shredding and assembly algorithm](https://github.com/julienledem/redelm/wiki/The-striping-and-assembly-algorithms-from-the-Dremel-paper) described in the Dremel paper to represent nested structures. +You can find additional details about the format and intended use cases in our [Hadoop Summit 2013 presentation](http://www.slideshare.net/julienledem/parquet-hadoop-summit-2013) ## Building