NVIDIA Triton Processor

Triton Inference Server is NVIDIA’s open-source platform for deploying and managing AI models at scale. It supports popular frameworks like TensorFlow, PyTorch, ONNX etc. offering flexibility in model integration. With support for both CPU and GPU, Triton ensures optimized, high-performance inference. Ideal for production workloads, it enables efficient, low-latency AI serving across diverse environments.

Configure the processor parameters as explained below.


Model Selection

Provide the details of the model to be used.


Connection

Select the NVIDIA Triton Connection that you created in Gathr.

Model

Select the model version to be used in the processor.


Configuration Tab

Provide the below details in the Configuration tab.


Request JSON

Enter the inference request JSON. Use <GathrColumn>variable_name</GathrColumn> in your request to define variables that will be replaced with input dataset column values.

Input

The placeholders <GathrColumn>variable_name</GathrColumn> provided in the prompt can be mapped to columns to replace its value with the placeholder keys.

Input from Request JSON

Placeholders ___ provided in the Request JSON will be extracted here. Map them with the input dataset columns.

Input column

Select the column to replace the placeholder variable.

Automap

Automap placeholders provided in Request JSON with input dataset columns. The placeholders variable name should be same as input dataset columns for automapping.


Output

The output can be configured to emit data received from input.

Output Column

Specify the column where the entire response will be stored.


Validation

Next step is to provide the below details under the Validation tab.

Validate Output Using

Validate the output according to your needs. Choose the rows you want to validate.

  • Top 3 Rows: Quickly validate the first three rows of the output. This is good for a rapid overview.

  • Custom Rows: For more precise validation, you can manually select specific rows to validate. Simply click on the rows you want to include.

  • Random Rows: Comprehensively validate random rows in the output.

Once you’ve made your selection, click the Validate button to initiate the validation process. The processor will perform the validation according to your chosen rows.


Review and Confirm

Thoroughly review the validation results to confirm if they align with the desired outcome. Adjust and Revalidate (if Necessary).

If you identify any errors or inconsistencies, you can go back to the Nvidia processor’s configuration section and make adjustment as needed.

Once you’re satisfied with the validation results, you can proceed to the next step and save the configurations.

Top