Skip to content
This repository was archived by the owner on Mar 30, 2021. It is now read-only.
sdesikan6 edited this page Jan 31, 2016 · 8 revisions

#Fast BI using Spark and Druid.

This project is aimed at two classes of users

  1. Users of Druid who want SQL access to their indexes and use traditional BI tools such as Tableau with Druid
  2. Spark and Hive users who find performance of their interactive BI painfully slow.

Where to start.

  1. Quick Start
  2. The Druid project
  3. Spark

##Indexing

  1. Indexing TPCH data as an example.

  2. Sample Indexes for an Ad Impression reporting example.

  3. Setting up Druid Druid.

Setting up the data

  1. Sample data set for TPCH.

  2. Ad Impressions example

  3. Star schema.

##Querying data from Spark

  1. Setup thrift server connections so you can use Squirrel, Razor SQL, Zeppelin or Tableau against the datasets.

  2. Sample Queries.

Clone this wiki locally