Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

vdk-events: improve Productionizing Jupyter Notebooks README #2896

Merged
74 changes: 74 additions & 0 deletions events/productionizing-jupyter-notebooks/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,74 @@
# Productionizing Jupyter Notebooks with Versatile Data Kit (VDK)

## Table of Contents
- [Goals](#Goals)
- [Agenda](#Agenda)
- [Purpose](#Purpose)
- [Background](#Background)
* [Objective](#Objective)
- [Tutorial](#Tutorial)
- [Exercises](#Exercises)
- [Lessons Learned](#Lessons-Learned)
- [Feedback](#Feedback)
- [Where to Find Us](#Where-to-Find-Us)

## Presenter
[Duygu Hasan](https://github.com/duyguHsnHsn) <a href='https://www.linkedin.com/in/duygu-hasan/'><img src="https://img.shields.io/badge/LinkedIn-0077B5"></a>

## Goals
- Understand the challenges associated with transitioning notebooks to production.
- Introduce the VDK solutions to address these challenges.

## Agenda
- Discuss strategies for productionizing Jupyter Notebooks.
- Showcase hands-on examples of these challenges in the Jupyter UI.

## Purpose
The purpose of this scenario is to demonstrate how to operationalize Jupyter notebooks using the Versatile Data Kit (VDK) Jupyter integration. By the end of this guide, you'll understand how to:
* Create a data job with VDK within a Jupyter notebook.
* Write a data workflow in a notebook and make it ready to be put in a production environment.

## Background
### Objective:
The tutorial walks you through standard ETL process retrieve,
clean, transform, load data in a notebook environment and addresses
the challenges linked to making that notebook ready for production.


### Versatile Data Kit Jupyter Integration
For detailed instructions on working with VDK, please refer to the guide from the provided [link](../../projects/vdk-plugins/vdk-jupyter/getting-started.ipynb).

## Tutorial
### **🌟🚀 [CLICK HERE TO LAUNCH THE TUTORIAL!](https://mybinder.org/v2/gh/versatile-data-kit-demo/productionizing-jupyter-notebooks/HEAD?labpath=tutorial-job%2F10_notebook.ipynb)🚀🌟**

The link did not work?
Try this one out:

[![Binder](https://mybinder.org/badge_logo.svg)](https://mybinder.org/v2/gh/versatile-data-kit-demo/productionizing-jupyter-notebooks/HEAD?labpath=tutorial-job%2F10_notebook.ipynb)


## Exercises
The tutorial-job directory contains the ready-to-use code. Make sure to explore it as it will provide hands-on experience with the objectives and VDK Jupyter integration discussed in this guide.

## Lessons Learned
**Congratulations! 🎉**

Throughout this scenario, you've:
* Explored the capabilities of the VDK Jupyter integration.
* Retrieved, cleaned, and processed data using Jupyter and VDK tools.
* Understood the process of ingesting data through VDK within a Jupyter environment.
* Understood the process of making notebooks ready for production with VDK.

## Your Feedback Matters!

We continuously strive to improve and your feedback is invaluable to us. Please take a moment to complete our survey. It will only take a few minutes.
### [**👉 Complete the Survey Here 👈**](https://forms.office.com/Pages/ResponsePage.aspx?id=yjiRs-48Skuk1s2D2d1i8AGV0VaygrpPnt7Tz5bBbeBUNFA5NkU3QzlNWEQyUFJCTTQwRUszWk9GUS4u)


## Where to Find Us
- [YouTube](https://www.youtube.com/channel/UCasf2Q7X8nF7S4VEmcTHJ0Q/about)
- [Twitter](https://twitter.com/vdkproject)
- [GitHub](https://github.com/vmware/versatile-data-kit)
- Relevant Links
- [An Overview of Versatile Data Kit](https://towardsdatascience.com/an-overview-of-versatile-data-kit-a812cfb26de7)
- [Community meeting: Productionizing Jupyter Notebooks with Versatile Data Kit](https://www.youtube.com/watch?v=U6M6UzsoiqY)
Binary file not shown.

This file was deleted.