In this project I intend to show how to perform Linear Regression analysis on the Coimbra Cancer Data Set
These instructions will get you a copy of the project up and running on your local machine for development and testing purposes. See deployment for notes on how to deploy the project on a live system.
- Windows 7/8/10:
- Intellij IDEA
- Scala version <= 2.11.x
- Apache Spark version 2.x.x
- sbt version 1.x.x
A step by step series of examples that tell you how to get a development env running
- Download:
- Intellij IDEA
- sbt
- Spark: - After downloading Apache Spark, follow the instructions from here - Make sure to check that your installation works by running the command --> spark-shell in your native command line
- Intellij IDEA - The IDE used in the development of this application
- sbt - Dependency Management Tool
- Scala - Scala version 2.11.12
- Apache Spark - Data Analysis library
- Ermain Paul - Initial work - Github Profile
- Packt Publishing--Scala Machine Learning Project
- Apache Spark API Documentation
- Packt Publishing--Modern Scala Projects