Bug 2376770 (CVE-2025-3044) - CVE-2025-3044 llama-index: MD5 Hash Collision in llama_index
Summary: CVE-2025-3044 llama-index: MD5 Hash Collision in llama_index
Keywords:
Status: NEW
Alias: CVE-2025-3044
Product: Security Response
Classification: Other
Component: vulnerability
Version: unspecified
Hardware: All
OS: Linux
medium
medium
Target Milestone: ---
Assignee: Product Security DevOps Team
QA Contact:
URL:
Whiteboard:
Depends On:
Blocks:
TreeView+ depends on / blocked
 
Reported: 2025-07-07 10:01 UTC by OSIDB Bzimport
Modified: 2025-07-08 13:42 UTC (History)
31 users (show)

Fixed In Version:
Clone Of:
Environment:
Last Closed:
Embargoed:


Attachments (Terms of Use)

Description OSIDB Bzimport 2025-07-07 10:01:45 UTC
A vulnerability in the ArxivReader class of the run-llama/llama_index repository, versions up to v0.12.22.post1, allows for MD5 hash collisions when generating filenames for downloaded papers. This can lead to data loss as papers with identical titles but different contents may overwrite each other, preventing some papers from being processed for AI model training. The issue is resolved in version 0.12.28.


Note You need to log in before you can comment on or make changes to this bug.