Skip to content

Latest commit

 

History

History
33 lines (22 loc) · 1.56 KB

README.md

File metadata and controls

33 lines (22 loc) · 1.56 KB

Open in Gitpod

Conveyor Samples

This repository contains a number of sample projects for Conveyor

Getting started

  1. Clone this repository
  2. Verify you have the Conveyor CLI installed by executing conveyor doctor
  3. Authenticate the Conveyor CLI conveyor auth login
  4. Create a new conveyor environment conveyor environment create --name samples
  5. Execute the additional steps listed in the prerequisites folder

Samples

Basic

  • pi_spark: use Apache Spark to calculate pi.
  • first_project_dbt: use dbt and DuckDB for the first time. Using this project is described in the Conveyor getting started guide.
  • coffee_shop_dbt: use dbt and DuckDB for cleaning and transforming the coffee shop input data and writing the results to S3.

MLOps

  • titanic: use the dataset from Kaggle to build a ML pipeline that will produce a model to predict survival.
  • housing: use the dataset from Kaggle to build a ML pipeline that will produce a model to predict the housing prices.

Various

  • alerting: use native Apache Airflow features to send notifications to Slack.