Images

Open In Colab

Runhouse clusters expose various functions that allow you to set up state, dependencies, and whatnot on all nodes of your cluster. These include:

  • cluster.install_packages(...)

  • cluster.rsync(...)

  • cluster.set_env_vars(...)

  • cluster.run_bash(...)

A Runhouse “Image” is simply an abstraction that allows you to run several setup steps before we install runhouse and bring up the Runhouse daemon and initial set up on your cluster’s nodes. You can also specify a Docker image_id as the “base image” of your Runhouse image.

Here’s a simple example of using the Runhouse Image abstraction in your cluster setup:

import runhouse as rh image = ( rh.Image(name="sample_image") .from_docker("python:3.12.8-bookworm") .install_packages(["numpy", "pandas"]) .sync_secrets(["huggingface"]) .set_env_vars({"RH_LOG_LEVEL": "debug"}) ) cluster = rh.cluster(name="ml_ready_cluster", image=image, instance_type="CPU:2+", provider="aws").up_if_not()
I 12-17 12:04:55 provisioner.py:560] Successfully provisioned cluster: ml_ready_cluster
I 12-17 12:04:57 cloud_vm_ray_backend.py:3402] Run commands not specified or empty.
Clusters
AWS: Fetching availability zones mapping...NAME              LAUNCHED        RESOURCES                                                                  STATUS  AUTOSTOP  COMMAND
ml_ready_cluster  a few secs ago  1x AWS(m6i.large, image_id={'us-east-1': 'docker:python:3.12.8-bookwor...  UP      (down)    /Users/rohinbhasin/minico...

[?25h

The growing listing of setup steps available for Runhouse images is available in the API reference docs.