This repository was archived by the owner on Mar 30, 2021. It is now read-only.

Home

Jump to bottom

sdesikan6 edited this page Jan 31, 2016 · 8 revisions

#Fast BI using Spark and Druid.

This project is aimed at two classes of users

Users of Druid who want SQL access to their indexes and use traditional BI tools such as Tableau with Druid
Spark and Hive users who find performance of their interactive BI painfully slow.

Where to start.

Quick Start
The Druid project
Spark

##Indexing

Indexing TPCH data as an example.
Sample Indexes for an Ad Impression reporting example.
Setting up Druid Druid.

Setting up the data

Sample data set for TPCH.
Ad Impressions example
Star schema.

##Querying data from Spark

Setup thrift server connections so you can use Squirrel, Razor SQL, Zeppelin or Tableau against the datasets.
Sample Queries.