Inference Engine Vulnerability in vLLM Affects Large Language Model Processing
CVE-2026-54235
6.9MEDIUM
What is CVE-2026-54235?
The vLLM inference and serving engine for large language models has a vulnerability related to its temperature validation gates. Prior to version 0.23.1rc0, the software employed comparison operators for handling temperature values, which failed to accurately process NaN (Not a Number) and positive Infinity according to Python's IEEE 754 float semantics. This flaw prevents correct validation and allows these problematic values to propagate into GPU sampling kernels, resulting in undefined behavior and potential crashes of the inference worker. This issue has been rectified in the release of version 0.23.1rc0.
Affected Version(s)
vllm < 0.23.1rc0
