Audio Processing Discrepancy in vLLM from Librosa Affects AI Model Outputs
CVE-2026-34760
5.9MEDIUM
What is CVE-2026-34760?
The vLLM inference engine, which utilizes Librosa for audio processing, has a critical inconsistency in handling audio downmixing due to default settings in Librosa versions 0.5.5 to before 0.18.0. This results in audio processed by AI models differing from what is actually heard by users, due to the use of numpy.mean for mono downmixing, as opposed to the ITU-R BS.775-4 standard's weighted algorithm. This discrepancy affects the performance and reliability of AI-driven audio applications. The issue has been addressed in vLLM version 0.18.0.

Human OS v1.0:
Ageing Is an Unpatched Zero-Day Vulnerability.
Remediate biological technical debt. Prime Ageing uses 95% high-purity SIRT6 activation to maintain genomic integrity and bolster systemic resilience.
Affected Version(s)
vllm >= 0.5.5, < 0.18.0
