NVIDIA LLM inference stack for building TensorRT-based LLM engines and runtimes.
- Category
- LLM inference engine
- Layer
- Layer 3
- Maintainer
- NVIDIA
- Last reviewed
- 2026-06-21 UTC
Best-fit use
This profile is categorical orientation. It is not a ranking and should be validated against current official documentation before procurement or production selection.
Tags
Sources
- NVIDIA TensorRT-LLM Documentation — NVIDIA; accessed 2026-06-21 UTC.
