Audio Processing Discrepancy in vLLM from Librosa Affects AI Model Outputs
CVE-2026-34760

5.9MEDIUM

Key Information:

Vendor: Vllm-project
Status: Vllm
Vendor: CVE Published:; 2 April 2026

🔔 Create Vllm-project Vulnerability Alert

What is CVE-2026-34760?

The vLLM inference engine, which utilizes Librosa for audio processing, has a critical inconsistency in handling audio downmixing due to default settings in Librosa versions 0.5.5 to before 0.18.0. This results in audio processed by AI models differing from what is actually heard by users, due to the use of numpy.mean for mono downmixing, as opposed to the ITU-R BS.775-4 standard's weighted algorithm. This discrepancy affects the performance and reliability of AI-driven audio applications. The issue has been addressed in vLLM version 0.18.0.