Overview#

Workflows turn complex computational pipelines into simple YAML definitions. You define what to run, how tasks connect, and what resources they need. OSMO handles the rest - scheduling, orchestration, and execution across your compute infrastructure.

What is a Workflow?#

Important

A workflow is a user-defined, directed acyclic graph (DAG) of tasks that is scheduled and executed by OSMO.

Key characteristics:

Workflows are defined in YAML and submitted via CLI or Web UI.
Tasks execute based on defined dependencies
Support serial, parallel, and combined execution patterns
Scheduled automatically by OSMO

What is a Task?#

Important

Tasks are the fundamental units of work in OSMO. A task is an independent environment that runs a list of commands within a Docker container.

Capabilities:

📂 Access local files, upstream task, or cloud storage
💻 Develop interactively with VSCode, Jupyter, or SSH
🔐 Use managed secrets for secure credential access
🖥️ Request specific hardware (GPU, CPU, RAM)
🔁 Configure automatic retries for failures
And much more!

What is a Group?#

Important

A group is a collection of tasks that are executed together. It synchronizes the execution of multiple tasks, enabling them to communicate within the same network.

Caution

groups and tasks fields are mutually exclusive in a workflow.

How groups work:

A single task in a group is designated as the group leader
All tasks in a group start together
Tasks can communicate over the network
Tasks may run on the same node or across different nodes
Supports both homogeneous (e.g., all x86_64) and heterogeneous (e.g., x86_64 + ARM64) architectures

Common patterns:

Distributed training - Multiple workers with parameter servers
Multi-stage pipelines - Tasks that need real-time coordination
Service architectures - Long-running services with dependent workers