data#

Data layer wrapping PipelineStore for the dashboard.

DashboardStore is a param.Parameterized adapter that queries the SQLite database and exposes results as pandas DataFrames suitable for Panel reactive updates.

Classes#

DashboardStore

Reactive wrapper around PipelineStore.

Module Contents#

class physicsnemo_curator.dashboard.data.DashboardStore(db_path: str, **kwargs: Any)#

Bases: param.Parameterized

Reactive wrapper around PipelineStore.

Provides pandas DataFrame views of pipeline metrics and supports auto-refresh for live monitoring of running pipelines.

Initialize the dashboard store.

Parameters:
  • db_path (str) – Path to an existing PipelineStore SQLite database.

  • **kwargs (Any) – Additional param keyword arguments.

all_artifacts() dict[str, list[str]]#

Return all filter artifacts across all indices.

Returns:

Mapping of filter name to list of all artifact paths.

Return type:

dict[str, list[str]]

artifacts(index: int) dict[str, list[str]]#

Return filter artifacts for a given index.

Parameters:

index (int) – Pipeline source index.

Returns:

Mapping of filter name to list of artifact paths.

Return type:

dict[str, list[str]]

output_paths(index: int) list[str]#

Return output file paths for a given index.

Parameters:

index (int) – Pipeline source index.

Returns:

Ordered list of output file paths.

Return type:

list[str]

property index_df: pandas.DataFrame#

DataFrame of per-index results.

Columns: index, status, wall_time_s, peak_memory_mb, gpu_memory_mb, error.

Returns:

One row per processed index.

Return type:

pd.DataFrame

property pipeline_config: dict#

Return the pipeline configuration dictionary.

Returns:

Pipeline configuration as stored in the database.

Return type:

dict

refresh#
refresh_interval#
selected_index#
property stage_df: pandas.DataFrame#

DataFrame of per-stage timing for all indices.

Columns: index, stage_name, stage_order, wall_time_s.

Returns:

One row per (index, stage) combination.

Return type:

pd.DataFrame

property summary: dict[str, Any]#

Summary of the pipeline run state.

Returns:

Keys: total, completed, failed, remaining, elapsed_s, config_hash, db_path, workers.

Return type:

dict[str, Any]

property workers_df: pandas.DataFrame#

DataFrame of registered workers.

Columns: worker_id, pid, hostname, started_at, last_heartbeat, current_index.

Returns:

One row per worker.

Return type:

pd.DataFrame