1. Introduction

This repository contains the source code for our OSDI'20 paper "PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications". In addition, it contains some codes for systems which are compared with our system.

2. Contents

PipeSwitch: Our proposed system.
Ready model: The process with the required model is already loaded in the GPU. This solution provides the lower bound, which is the lowest latency we can achieve for an inference task.
Kill and restart: It stops the training task in the GPU, and then starts the inference task.
PyTorch plugins: Modified PyTorch files, which are necessary for running PipeSwitch.
Tasks: Models used for our evaluations.
Util: Some common functions. For example, establishing TCP connections.
Clients: Clients for sending requests.

Compile PyTorch for PipeSwitch. Ready-model and kill-and-restart could use original PyTorch.
Start the server you are interested, which can be PipeSwitch, ready-model or kill-and-restart.
Start the client to send requests.

More details are included in README under folders for each system.

If you have any question, please contact zbai1 at jhu dot edu

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
client		client
kill_restart		kill_restart
pipeswitch		pipeswitch
pytorch_plugin		pytorch_plugin
ready_model		ready_model
task		task
util		util
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md