Visual Builder#
-
class VisualBuilder
Builder class for visual encoder TensorRT engines. Handles the complete process of building TensorRT engines from ONNX models for visual encoders used in Vision-Language Models.
Public Functions
- VisualBuilder(
- std::filesystem::path const &onnxDir,
- std::filesystem::path const &engineDir,
- VisualBuilderConfig const &config
Constructor for VisualBuilder.
- Parameters:
onnxDir – Directory containing the ONNX model and configuration files
engineDir – Directory where the built engine and related files will be saved
config – Configuration object specifying build parameters
-
~VisualBuilder() noexcept = default
Destructor.
-
bool build()
Build the TensorRT engine from the ONNX model. This method performs the complete build process including:
Loading and parsing the ONNX model
Setting up optimization profiles
Building the TensorRT engine
Copying necessary files to the engine directory
- Returns:
true if build was successful, false otherwise
-
struct VisualBuilderConfig
Configuration structure for visual model building. Contains parameters needed to configure the TensorRT engine building process for visual encoders used in Vision-Language Models.
Public Functions
-
inline Json toJson() const
Convert configuration to JSON format for serialization.
- Returns:
JSON object containing all configuration parameters
-
inline std::string toString() const
Convert configuration to human-readable string format.
- Returns:
String representation of the configuration for debugging/logging
Public Members
-
int64_t minImageTokens = {4}
Minimum number of image tokens in a batch.
-
int64_t maxImageTokens = {1024}
Maximum number of image tokens in a batch.
-
int64_t maxImageTokensPerImage = {512}
Maximum number of image tokens per image.
Public Static Functions
-
static inline VisualBuilderConfig fromJson(Json const &json)
Create configuration from JSON format.
- Parameters:
json – JSON object containing configuration parameters
- Returns:
VisualBuilderConfig object with parsed parameters
-
inline Json toJson() const