Bug 2363025 (CVE-2025-46560) - CVE-2025-46560 vllm: vLLM phi4mm: Quadratic Time Complexity in Input Token Processing​ leads to denial of service
Summary: CVE-2025-46560 vllm: vLLM phi4mm: Quadratic Time Complexity in Input Token Pr...
Keywords:
Status: NEW
Alias: CVE-2025-46560
Product: Security Response
Classification: Other
Component: vulnerability
Version: unspecified
Hardware: All
OS: Linux
medium
medium
Target Milestone: ---
Assignee: Product Security DevOps Team
QA Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2025-04-30 01:01 UTC by OSIDB Bzimport
Modified: 2025-04-30 10:07 UTC (History)
1 user (show)

Fixed In Version:
Clone Of:
Environment:
Last Closed:
Embargoed:


Attachments (Terms of Use)

Description OSIDB Bzimport 2025-04-30 01:01:11 UTC
vLLM is a high-throughput and memory-efficient inference and serving engine for LLMs. Versions starting from 0.8.0 and prior to 0.8.5 are affected by a critical performance vulnerability in the input preprocessing logic of the multimodal tokenizer. The code dynamically replaces placeholder tokens (e.g., <|audio_|>, <|image_|>) with repeated tokens based on precomputed lengths. Due to ​​inefficient list concatenation operations​​, the algorithm exhibits ​​quadratic time complexity (O(n²))​​, allowing malicious actors to trigger resource exhaustion via specially crafted inputs. This issue has been patched in version 0.8.5.


Note You need to log in before you can comment on or make changes to this bug.