Memory Overflow Vulnerability in NVIDIA Triton Inference Server for Windows and Linux
CVE-2025-23320
7.5HIGH
What is CVE-2025-23320?
A memory overflow vulnerability exists in the Python backend of NVIDIA Triton Inference Server for both Windows and Linux platforms. This issue allows attackers to exploit the server by sending excessively large requests, potentially causing the shared memory limits to be exceeded. If successfully exploited, the vulnerability could lead to sensitive information being disclosed, compromising the security and integrity of the data processed by the server.
Affected Version(s)
Triton Inference Server Windows All versions prior to 25.07