FT Launcher & Inprocess integration

FT launcher integrates with Inprocess recovery mechanisms, improving fault tolerance by coordinating injob and inprocess fault recovery.

1. Heartbeat Mechanism

  • The FT launcher heartbeat remains active throughout execution to detect and mitigate potential hangs.

  • Users must configure timeouts manually, ensuring they exceed inprocess operational timeouts to prevent conflicts.

2. Restart Policy

The --ft-restart-policy argument is deprecated. Only any-failed is supported: the launcher restarts all workers when any worker group fails. Use of --ft-restart-policy may be removed in a future release.

Support for combining injob (FT launcher) and inprocess recovery in a single workload is being re-evaluated; a revised integration model may be documented in a future release.