Skip to content

Latest commit

 

History

History
77 lines (52 loc) · 2.02 KB

README.md

File metadata and controls

77 lines (52 loc) · 2.02 KB

CloudFormation templates for setting up Amazon EMR clusters.

Features

  • Infra layer with IAM roles, S3 buckets and security groups
  • EMR cluster with JupyterLab & Zeppelin notebook environments

Prerequisites

You'll need the VPC and bucket stacks from sjakthol/aws-account-infra.

Deployment

Infra

Deploy infra (buckets, roles, policies, security groups etc.):

make deploy-infra-emr

Clusters

Deploy EMR clusters:

# Default cluster
make deploy-cluster

# Specific EMR version (supports EMR versions that use Amazon Linux 2)
make deploy-cluster-emr-5.36.0
make deploy-cluster-emr-6.7.0
make deploy-cluster-emr-x.x.x

When the cluster is ready, use Amazon SSM Session Manager to access services running on the cluster (requires SSM Session Manager Plugin for AWS CLI). Execute

./scripts/setup-port-forwarding.sh <master_instance_id>

where <master_instance_id> is the ID of the EMR cluster master node to establish a connection to the following services

Cleanup

Delete clusters:

# Default cluster
make delete-cluster

# Cluster created with specific EMR version
make delete-cluster-emr-5.36.0
make delete-cluster-emr-6.7.0
make delete-cluster-emr-x.x.x

Delete infra (must empty S3 buckets and clean EMR managed security group rules manually)

make delete-infra-emr

Credits

See Also

License

MIT