Setting up RunPod

First part of 3-part series

This is the first part in a 3-part series about using RunPod to share Streamlit apps. Here are the first two parts:

Sometimes — often as of late — I need to test machine-learning models that require compute resources beyond what my meager laptop can handle. This necessitates a cloud server with computational resources (i.e., GPUs). My go-to service is RunPod. However, no matter how many times I use it, I always encounter some "gotcha" that makes my work take longer than it should. Each time this happens, I think, "ah crap, I should write that down." And then I promptly ignore myself and write nothing.

Well, not this time. These are my notes for setting up RunPod.

Pro Tip : Start with Storage

The perils of Pods

When I first started using RunPod, I would immediately go to the "Pods" page (it is called "RunPod" after all...), find a GPU powerful enough to complete my task, and then deploy it. Here's what the Pods page looks like as of this writing:

Alt Runpod Pods page = 450x — RunPod 'Pods' page

There are two problems with this approach:

Pods are expensive. For the kinds of tasks I'm running, I need at least 24GB of VRAM, which requires a GPU that costs around $.40/hr. If I just left this running, that would be $9.60/day, or $67.20/week. It adds up quick.
Pods can be stolen. RunPod has an option to "stop" a running Pod. This seems reasonable because I'm just testing a model and I'm not a maniac that wants to pay $67.20/week to keep the Pod live. HOWEVER, once I stop the Pod, the GPU becomes free for someone else to use. So, when I try to restart the Pod, I may find — and, if my experience is representative, almost always will find — that there are "Zero GPU Pods" available. I just spent maybe an hour getting my environment all set up on the Pod and now I can't even access the GPU.

The necessity of Network Volumes

A better approach is to first go to the "Storage" page:

Alt Runpod Storage page — RunPod 'Storage' Page

From here, click "New Network Volume", and make sure to set it up in a region that has availability for the required GPU (typically, I will keep the "Pods" page open in a separate tab to see the GPU specs and pricing). Then, attach the Network Volume to a Pod and set up an environment in /workspace.

Use /workspace

The /workspace folder persists on the volume, so make sure to set up code and environments there.

Why is this better?

Network Volumes are cheap. I currently have two network volumes for a couple of projects I'm running. Each one has 50GB of storage and costs only $3.50/month.
Network Volumes are versatile. Once I have my environment set up in /workspace, I can safely terminate any Pod connected to the Network Volume, knowing that the environment will persist on the Network Volume. Then, when I'm ready to run more tests, I can simply attach the Volume to another Pod.

How to connect a Pod

Once the Network Volume is created, connect a Pod from the Storage page (NOT the Pods page) — click on the Volume, then click on the button "Configure Pod with Volume"

Adding a git repository

Once the Network Volume is connected to a Pod, I then add the code I want to execute. I code locally with version control and push changes to a GitHub repository.

Public GitHub repo

To add a public GitHub repository to a Pod, simply clone the repository into /workspace:

git clone git@github.com:YOUR_REPOSITORY`

Private GitHub repo

For a private GitHub repo, the repo must be accessible first. The easiest way to do this is with a GitHub Deploy Key. Here's how:

Connect to the Pod from the terminal via SSH. When I click on the Pod, I can easily copy the code I need to connect. Just paste it into the terminal.
Create an SSH key on the Pod:

ssh-keygen -t ed25519 -C "runpod-demo"

When prompted, repeatedly hit enter to use the default behavior and not set a passphrase.

View the public key and copy it:

cat ~/.ssh/id_ed25519.pub

It looks something like this:

ssh-ed25519 AAAAC3NzaC1lZDI1NTE5AAAAI... runpod-demo

Add Deploy Key to GitHub:

Go to GitHub → Security → Deploy keys and click "Add deploy key."

Title it whatever (typically the same title chosen for the key; above it was "runpod-demo"), then paste the entire contents of the key into the Key box.

After the key is added, I should be able to clone the repository to the Pod (and pull changes)

Step 3: Set up the Pod

Once I have cloned the repo, I need to install dependencies. On RunPod, the default machine can come with older versions of Ubuntu (e.g., 20.04) and Python (e.g., 3.8). So, depending on the Python version I need, the first step may be to update Python.

1. Set Python version (optional)

On an older version of Ubuntu (e.g., 20.04), I'll install Python using deadsnakes PPA:

apt-get update
apt-get install -y software-properties-common
add-apt-repository -y ppa:deadsnakes/ppa
apt-get update
apt-get install -y python3.11 python3.11-venv python3.11-distutils

If I'm on a newer version of Ubuntu (e.g., 22.04+), I can skip the first 3 lines.

Note: This is assuming I want to work with Python 3.11 (and the Pod has something older). Just replace with the desired version.

2. Set up Python virtual environment

Then, I'll use my desired Python version to set up a virtual environment:

python3.11 -m venv /workspace/venvName
source /workspace/venvName/bin/activate
python --version   # sanity check: should show 3.11.x

Obviously, set the venvName for the specific project.

3. Install Python dependencies inside that virtual environment

pip install --upgrade pip wheel setuptools
cd /workspace/git-repo
pip install -r requirements.txt

This assumes I have already set up my requirements.txt and it's in the GitHub repo (/git-repo is the cloned git repository from before).

4. Install non-Python dependencies (optional)

I'm currently doing a lot of work in music information retrieval and I use FFmpeg for reading, writing, and converting audio files. Thus, I need to install this:

apt-get update
apt-get install -y ffmpeg

Now my Pod is ready for whatever Python project I want to throw at it. And, so long as I put everything in /workspace, I can safely terminate the Pod when I want to save some money, knowing I can reattach the Network Volume to another Pod later.