Visual Builder#

class VisualBuilder

Builder class for visual encoder TensorRT engines. Handles the complete process of building TensorRT engines from ONNX models for visual encoders used in Vision-Language Models.

Public Functions

VisualBuilder(
std::filesystem::path const &onnxDir,
std::filesystem::path const &engineDir,
VisualBuilderConfig const &config
)

Constructor for VisualBuilder.

Parameters:
  • onnxDir – Directory containing the ONNX model and configuration files

  • engineDir – Directory where the built engine and related files will be saved

  • config – Configuration object specifying build parameters

~VisualBuilder() noexcept = default

Destructor.

bool build()

Build the TensorRT engine from the ONNX model. This method performs the complete build process including:

  • Loading and parsing the ONNX model

  • Setting up optimization profiles

  • Building the TensorRT engine

  • Copying necessary files to the engine directory

Returns:

true if build was successful, false otherwise

struct VisualBuilderConfig

Configuration structure for visual model building. Contains parameters needed to configure the TensorRT engine building process for visual encoders used in Vision-Language Models.

Public Functions

inline Json toJson() const

Convert configuration to JSON format for serialization.

Returns:

JSON object containing all configuration parameters

inline std::string toString() const

Convert configuration to human-readable string format.

Returns:

String representation of the configuration for debugging/logging

Public Members

int64_t minImageTokens = {4}

Minimum number of image tokens in a batch.

int64_t maxImageTokens = {1024}

Maximum number of image tokens in a batch.

int64_t maxImageTokensPerImage = {512}

Maximum number of image tokens per image.

Public Static Functions

static inline VisualBuilderConfig fromJson(Json const &json)

Create configuration from JSON format.

Parameters:

json – JSON object containing configuration parameters

Returns:

VisualBuilderConfig object with parsed parameters