PyStributed: Remote Execution Utility

This package allows users to mark specific code cells in a Jupyter Notebook for remote execution. Once marked, the code will be packaged into a Docker container, sent to a specified remote server for execution, and the results will be fetched back to the local machine.

Installation

Extract the contents of this directory to a location on your machine.
Navigate to the directory and run pip install . to install the package.
This package can also be installed using pip install pystributed.

Prerequisites

Docker installed on both local and remote machines.
SSH access to the remote machine.
PyTorch and Transformers libraries if you are using them in your code.

Configuration

Before using the package, you need to set up some configurations in config.py. Template can be seen below.
docker_utils.py is used to set docker file and container environment variables
remote_utils.py is used to set docker container volume and host save location

Usage

In your Jupyter Notebook, import the package:

import pystributed.main as runner

Use the %%save_for_remote magic command to mark the code cell you want to run remotely:

%%save_for_remote

# Your code here
# For example:
import torch
model = torch.load('my_model.pth')
result = model(some_data)

After marking the desired code cell, run the cell to obtain results.

Under the Hood

The package works in the following sequence:

The code cell marked with %%save_for_remote is saved to a Python script (temp_script.py by default).
A Docker image is built with the user's code and necessary dependencies.
The Docker image is pushed to the specified Docker registry.
The package SSHs into the specified remote server, pulls the Docker image, and runs it.
Once the code execution is complete on the remote server, the results are fetched and saved to the local machine.

Configuration

To customize the configuration for pystributed, create a config.json file in the following location:

~/.pystributed/config.json

Here's a detailed structure for the config.json file:

{
    "BASE_IMAGE": "Name of the base docker image you want to use",
    "PACKAGES_NEEDED": "Packages needed to execute code snippet",
    "DOCKER_IMAGE_NAME": "Name of the Docker image that will be created.",
    "DOCKER_REGISTRY": "Docker registry where the image will be pushed.",
    "REMOTE_SERVER": "SSH-compatible address of your remote server (e.g., `user@remote_server_ip`).",
    "REMOTE_WORKDIR": "Working directory on the remote server where results will be stored.",
    "USER_CODE_PATH": "Temporary path on the local machine where the user's code will be saved before packaging.",
    "SSH_PRIVATE_KEY_PATH": "Location of .pem file to access remote instance",
    "GPU": "Set to 'True' if you are using a remote instance that requires your docker container to access the GPU on the instance"
}

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
build/lib/pystributed		build/lib/pystributed
pystributed.egg-info		pystributed.egg-info
pystributed		pystributed
Dockerfile		Dockerfile
MANIFEST.in		MANIFEST.in
README.md		README.md
Sample_notebook.ipynb		Sample_notebook.ipynb
sample_config.json		sample_config.json
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PyStributed: Remote Execution Utility

Installation

Prerequisites

Configuration

Usage

Under the Hood

Configuration

About

Releases

Packages

Languages

evan-anthony/pystributed

Folders and files

Latest commit

History

Repository files navigation

PyStributed: Remote Execution Utility

Installation

Prerequisites

Configuration

Usage

Under the Hood

Configuration

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages