Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Consider using Cloudera Director for cluster provisioning #44

Closed
laserson opened this issue Apr 22, 2015 · 2 comments
Closed

Consider using Cloudera Director for cluster provisioning #44

laserson opened this issue Apr 22, 2015 · 2 comments
Assignees

Comments

@laserson
Copy link
Contributor

The Spark EC2 scripts have variable quality depending on instance types and options that are set. Also, they mainly just set up Spark, whereas we may want some of the other tools in the Hadoop stack (e.g., @tomwhite's partitioning tool that uses Crunch/Hadoop 2.x). Cloudera Director may make it more reliable to set up the whole stack, and also may more easily support using alternate clouds as well.

@tomwhite
Copy link
Member

I created #45 for this. This change allows you to bring up a cluster using Cloudera Director - there's still more follow-on work to install eggo on the gateway node (need to change setup_master it to run as a non-root user).

@tomwhite
Copy link
Member

Merged #45

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants