19
19
20
20
# Ballista: Distributed SQL Query Engine, built on Apache Arrow
21
21
22
- Ballista is a distributed SQL query engine powered by the Rust implementation of [ Apache Arrow] ( arrow ) and
23
- [ DataFusion] ( datafusion ) .
22
+ Ballista is a distributed SQL query engine powered by the Rust implementation of [ Apache Arrow] [ arrow ] and
23
+ [ DataFusion] [ datafusion ] .
24
24
25
25
If you are looking for documentation for a released version of Ballista, please refer to the
26
- [ Ballista User Guide] ( user-guide ) .
26
+ [ Ballista User Guide] [ user-guide ] .
27
27
28
28
## Overview
29
29
@@ -38,14 +38,14 @@ Ballista implements a similar design to Apache Spark (particularly Spark SQL), b
38
38
Apache Spark in some cases, which means that more processing can fit on a single node, reducing the overhead of
39
39
distributed compute.
40
40
- The use of Apache Arrow as the memory model and network protocol means that data can be exchanged efficiently between
41
- executors using the [ Flight Protocol] ( flight ) , and between clients and schedulers/executors using the
42
- [ Flight SQL Protocol] ( flight-sql )
41
+ executors using the [ Flight Protocol] [ flight ] , and between clients and schedulers/executors using the
42
+ [ Flight SQL Protocol] [ flight-sql ]
43
43
44
44
## Features
45
45
46
46
- Supports HDFS as well as cloud object stores. S3 is supported today and GCS and Azure support is planned.
47
47
- DataFrame and SQL APIs available from Python and Rust.
48
- - Clients can connect to a Ballista cluster using [ Flight SQL] ( flight-sql ) .
48
+ - Clients can connect to a Ballista cluster using [ Flight SQL] [ flight-sql ] .
49
49
- JDBC support via Arrow Flight SQL JDBC Driver
50
50
- Scheduler web interface and REST UI for monitoring query progress and viewing query plans and metrics.
51
51
- Support for Docker, Docker Compose, and Kubernetes deployment, as well as manual deployment on bare metal.
@@ -97,7 +97,7 @@ The current focus is on the following items:
97
97
## Architecture Overview
98
98
99
99
There are currently no up-to-date architecture documents available. You can get a general overview of the architecture
100
- by watching the [ Ballista: Distributed Compute with Rust and Apache Arrow] ( ballista-talk ) talk from the New York Open
100
+ by watching the [ Ballista: Distributed Compute with Rust and Apache Arrow] [ ballista-talk ] talk from the New York Open
101
101
Statistical Programming Meetup (Feb 2021).
102
102
103
103
## Contribution Guide
0 commit comments