sflow v0.2.1 Release Notes
Release date: April 2026 Previous release: v0.2.0 (March 2026)
Highlights
sflow v0.2.1 is a documentation and workflow polish release for the InfMax v3 migration path. It documents the branch behavior for CSV-driven execution, self-contained YAML batch submission, replica variable domains, node placement, and probe orchestration.
User-Facing Changes
CLI and Batch Workflows
sflow run --bulk-inputnow has documented single-row CSV execution. Use--rowwith exactly one selector to run a specific CSV row.- Advanced
--rowselectors are documented forrun,compose, andbatch: repeated flags, comma lists, Python-style slices with exclusive end, open-ended slices, and negative indices such as--row=-1. sflow batch --bulk-submitis documented for submitting self-contained YAML files, folders, or glob patterns without CSV merging.- Auto-derived node counts are documented. Single-job and bulk-submit batch modes can derive
--nodesfrom the Slurm backend; bulk-input mode requires either--nodesor a CSV node-count column. --sflow-versionis documented for pinning the git ref installed by generated sbatch scripts.- Expression-aware
--sbatch-extra-argsis documented. Extra sbatch directives can resolve${{ variables.X }}or shorthand${{ X }}from config defaults, CLI--set, and CSV row values.
Variables and Replica Sweeps
- Variable domain metadata is documented through
${{ variables.NAME.domain }}. - Replica sweep behavior is clarified:
${{ variables.NAME }}resolves to the per-replica value, while${{ variables.NAME.domain }}remains the full domain list. - Domain overrides via
--setare documented: JSON-style list values update the variabledomain, and the variable value becomes the first list item.
Resources and Placement
resources.nodes.excludeis documented for removing nodes from the placement pool before applyingindices,count, or GPU packing.- Negative node indices are clarified, including the fact that negative
indicesare resolved afterexcludefiltering. - Default Slurm placement is documented: when a task does not set
resources.nodes, sflow passes the full backend allocation tosrun. - GPU packing behavior is documented, including multi-node expansion when a GPU request is an exact multiple of
gpus_per_node.
Probes
- Probe timing defaults are documented, including
timeout: 1200for readiness probes andeach_check_timeout: 30. - HTTP probes (
http_getandhttp_post) are documented with examples. - Multiple readiness probes are documented as AND semantics: all readiness probes must trigger before a task becomes ready.
- Failure probes are documented as fail-fast signals that mark tasks as failed by probe and cancel downstream work.
- Replica HTTP probe deduplication is documented for parallel replicas with identical HTTP probes.
Documentation Updated
docs/user/cli.mddocs/user/variables.mddocs/user/resources.mddocs/user/probes.mddocs/user/quick-reference.mddocs/user/configuration.mddocs/user/architecture.md