2363025 – (CVE-2025-46560) CVE-2025-46560 vllm: vLLM phi4mm: Quadratic Time Complexity in Input Token Processing leads to denial of service

Bug 2363025 (CVE-2025-46560) - CVE-2025-46560 vllm: vLLM phi4mm: Quadratic Time Complexity in Input Token Processing leads to denial of service

Summary: CVE-2025-46560 vllm: vLLM phi4mm: Quadratic Time Complexity in Input Token Pr...

Keywords:
Status:	NEW
Alias:	CVE-2025-46560
Product:	Security Response
Classification:	Other
Component:	vulnerability
Sub Component:
Version:	unspecified
Hardware:	All
OS:	Linux
Priority:	medium
Severity:	medium
Target Milestone:	---
Assignee:	Product Security DevOps Team
QA Contact:
Docs Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+	depends on / blocked

Reported:	2025-04-30 01:01 UTC by OSIDB Bzimport
Modified:	2025-04-30 10:07 UTC (History)
CC List:	1 user (show)
Fixed In Version:
Clone Of:
Environment:
Last Closed:
Embargoed:

Attachments	(Terms of Use)

Description OSIDB Bzimport 2025-04-30 01:01:11 UTC

vLLM is a high-throughput and memory-efficient inference and serving engine for LLMs. Versions starting from 0.8.0 and prior to 0.8.5 are affected by a critical performance vulnerability in the input preprocessing logic of the multimodal tokenizer. The code dynamically replaces placeholder tokens (e.g., <|audio_|>, <|image_|>) with repeated tokens based on precomputed lengths. Due to inefficient list concatenation operations, the algorithm exhibits quadratic time complexity (O(n²)), allowing malicious actors to trigger resource exhaustion via specially crafted inputs. This issue has been patched in version 0.8.5.

Note You need to log in before you can comment on or make changes to this bug.