This repository contains the necessary files to set up an Apache Spark cluster using Docker and Docker Compose.
- Docker
- Docker Compose
-
Clone this repository:
git clone https://github.com/isangwanrahul/spark-docker-setup.git cd spark-docker-setup
-
Create the
notebooks
directory:mkdir notebooks
-
Start the Spark cluster:
docker-compose up
- Spark Master: Available at http://localhost:8080
- Spark Worker 1: Available at http://localhost:8081
- Spark Worker 2: Available at http://localhost:8082
- Jupyter Notebook: Available at http://localhost:8888
To stop the Spark cluster, run:
docker-compose down