This workshop will employ computational data science techniques covering the first 4 main steps of a typical data science pipeline:
- Acquisition
- Tidying/Manipulation
- Trasnformation
- Visualization/Communication
Attendees will be presented with healthcare data apply the taught methodologies to solve a representative problem. This workshop will use the R programming language with a major emphasis on the libraries within the Tidyverse. A keen understanding of functional programming languages and/or R is recommended but not required. The goal of the workshop is to give attendees the opportunity to deploy a near complete data science pipeline.
What you will need:
- Presentation:
slides.pdf
- Code:
import_tidy_transform_viz.Rmd
- R & RStudio
- PLEASE MAKE SURE YOU HAVE THIS SETUP
- It would be ideal to follow along locally, on your own machine
- Here is a TUTORIAL FOR WINDOWS & TUTORIAL FOR MAC that can help you get up and running :)
- If you do not have R & RStudio set up on your machine by the time the workshop rolls around, no fear - cloud is here! 🦸♀
- Cohort #1 - [INSERT RSTUDIO CLOUD LINK HERE]
- Cohort #2 - [INSERT RSTUDIO CLOUD LINK HERE]
- PLEASE MAKE SURE YOU HAVE THIS SETUP
Just a real, down-to-Mars girl, living in a material world, who took a midnight train going anywhere. Boring things: Statistical Programmer at the University of Miami Sylvester Comprehensive Cancer Center, R-Ladies Miami Organizer, Social Justice in Data Science Bookclub Organizer, and #IamRemarkable Facilitator. Not so boring: over-zealous plant-mom and wannabe wine connoisseur.
🐦: @Bouzoulay | My Blog | LinkedIn