Download Latest Version v2.0.139 source code.tar.gz (3.3 MB)
Email in envelope

Get an email when there's a new version of Memvid

Home / v2.0.157
Name Modified Size InfoDownloads / Week
Parent folder
README.md 2026-02-15 2.7 kB
v2.0.157 source code.tar.gz 2026-02-15 3.3 MB
v2.0.157 source code.zip 2026-02-15 3.4 MB
Totals: 3 Items   6.7 MB 2

v2.0.157

Release Date: February 15, 2026

Overview

This release adds a structured XLSX extraction pipeline with table detection, OOXML metadata parsing, and semantic chunking. It also removes a vulnerable xlsx (SheetJS) dependency from the Node SDK, fixes the CLI deploy pipeline for proprietary crate handling, and includes clippy/lint fixes and documentation updates.


🚀 New Features

Structured XLSX Extraction Pipeline (memvid-core)

  • New XlsxReader::extract_structured() API for high-accuracy spreadsheet extraction
  • Automatic table boundary and header detection via heuristics and OOXML table definitions
  • Row-aligned semantic chunking that never splits rows across chunk boundaries
  • Formats rows as Header: Value | Header: Value pairs for optimal search accuracy
  • OOXML metadata parsing: number formats (dates, currency, percentages), merged cell regions, named table definitions
  • Column type inference (text, integer, float, date, currency, percentage, boolean)
  • Backward-compatible flat text output alongside structured chunks
  • New modules: xlsx_chunker, xlsx_ooxml, xlsx_table_detect

Remove Vulnerable xlsx Dependency — Issue [#198]

  • Removed SheetJS xlsx@0.18.5 from @memvid/sdk (CVE-2024-22363, CVE-2023-30533)
  • Production code already used ExcelJS — only example files were updated
  • Downstream users no longer receive Dependabot security alerts from @memvid/sdk

CLI Deploy Fix: Proprietary Crate Handling

  • Made memvid-ghostpack optional in memvid-ask-model and removed from workspace members
  • CI builds no longer fail when proprietary crates are absent (.gitignore'd)
  • Ghost model kind returns a clean error when the runtime is unavailable

🐛 Bug Fixes

  • Fixed clippy pedantic lints (implicit_clone, cast_possible_truncation)
  • Fixed dead_code warning for propagate_merged_cells
  • Resolved VecIndexManifest model field lint
  • xlsx_structured tests now gracefully skip on CI when fixture file is absent

📝 Documentation

  • Chinese (Simplified) README translation (#193 by @nightire)
  • README updates (@mo-omar-0197)

  • [#198] — Remove vulnerable xlsx (SheetJS) dependency (@intergrado-cg report, @Olow304 fix)
  • [#193] — Chinese README translation (@nightire, merged by @sharafdin)

🙏 Contributors

Thank you to all contributors who made this release possible:

  • @Olow304 — Structured XLSX pipeline, xlsx vulnerability fix, CLI deploy fix, clippy/lint cleanup
  • @nightire — Chinese (Simplified) README translation
  • @sharafdin — PR review and merge
  • @mo-omar-0197 — README updates
  • @intergrado-cg — Reported xlsx security vulnerability (#198)
Source: README.md, updated 2026-02-15