vllm

Utility to convert a Model Optimizer exported model to vLLM Checkpoint.

Functions

Exports the torch model to vLLM checkpoint and saves to export_dir.

export_to_vllm(model, tokenizer, export_path='/tmp')

Exports the torch model to vLLM checkpoint and saves to export_dir.

Parameters:

model (Module) – the torch model
tokenizer (Module) – the tokenizer used for model
export_path (Path | str) – Path for exporting the vLLM compatible quantized checkpoint