Model serving platform with model repositories, HTTP/gRPC APIs, schedulers, dynamic batching, and multiple backends.
- Category
- Model server
- Layer
- Layer 4
- Maintainer
- NVIDIA
- Last reviewed
- 2026-06-21 UTC
Best-fit use
This profile is categorical orientation. It is not a ranking and should be validated against current official documentation before procurement or production selection.
Tags
Sources
- NVIDIA Triton Inference Server Architecture — NVIDIA; accessed 2026-06-21 UTC.
