Runtime directory profile

TensorRT-LLM

NVIDIA LLM inference stack for building TensorRT-based LLM engines and runtimes.

Status: Foundational Last verified: 2026-06-21 UTC Comparison: Scoped facts; no universal score

NVIDIA LLM inference stack for building TensorRT-based LLM engines and runtimes.

Category: LLM inference engine
Layer: Layer 3
Maintainer: NVIDIA
Last reviewed: 2026-06-21 UTC

Best-fit use

This profile is categorical orientation. It is not a ranking and should be validated against current official documentation before procurement or production selection.

Sources

NVIDIA TensorRT-LLM Documentation — NVIDIA; accessed 2026-06-21 UTC.

Maintenance record

Last materially changed: 2026-06-21 UTC
Last reviewed: 2026-06-21 UTC

Found an error, outdated capability, or unclear category boundary? Submit a correction with a supporting source.

Find runtime definitions and implementation guidance

TensorRT-LLM

Best-fit use

Tags

Sources

Maintenance record