| Name | Modified | Size | Downloads / Week |
|---|---|---|---|
| Parent folder | |||
| README.md | 2026-02-15 | 2.7 kB | |
| v2.0.157 source code.tar.gz | 2026-02-15 | 3.3 MB | |
| v2.0.157 source code.zip | 2026-02-15 | 3.4 MB | |
| Totals: 3 Items | 6.7 MB | 2 | |
v2.0.157
Release Date: February 15, 2026
Overview
This release adds a structured XLSX extraction pipeline with table detection, OOXML metadata parsing, and semantic chunking. It also removes a vulnerable xlsx (SheetJS) dependency from the Node SDK, fixes the CLI deploy pipeline for proprietary crate handling, and includes clippy/lint fixes and documentation updates.
🚀 New Features
Structured XLSX Extraction Pipeline (memvid-core)
- New
XlsxReader::extract_structured()API for high-accuracy spreadsheet extraction - Automatic table boundary and header detection via heuristics and OOXML table definitions
- Row-aligned semantic chunking that never splits rows across chunk boundaries
- Formats rows as
Header: Value | Header: Valuepairs for optimal search accuracy - OOXML metadata parsing: number formats (dates, currency, percentages), merged cell regions, named table definitions
- Column type inference (text, integer, float, date, currency, percentage, boolean)
- Backward-compatible flat text output alongside structured chunks
- New modules:
xlsx_chunker,xlsx_ooxml,xlsx_table_detect
Remove Vulnerable xlsx Dependency — Issue [#198]
- Removed SheetJS
xlsx@0.18.5from@memvid/sdk(CVE-2024-22363, CVE-2023-30533) - Production code already used ExcelJS — only example files were updated
- Downstream users no longer receive Dependabot security alerts from
@memvid/sdk
CLI Deploy Fix: Proprietary Crate Handling
- Made
memvid-ghostpackoptional inmemvid-ask-modeland removed from workspace members - CI builds no longer fail when proprietary crates are absent (
.gitignore'd) - Ghost model kind returns a clean error when the runtime is unavailable
🐛 Bug Fixes
- Fixed clippy pedantic lints (
implicit_clone,cast_possible_truncation) - Fixed
dead_codewarning forpropagate_merged_cells - Resolved VecIndexManifest model field lint
xlsx_structuredtests now gracefully skip on CI when fixture file is absent
📝 Documentation
- Chinese (Simplified) README translation (#193 by @nightire)
- README updates (@mo-omar-0197)
📚 Related Issues & PRs
- [#198] — Remove vulnerable xlsx (SheetJS) dependency (@intergrado-cg report, @Olow304 fix)
- [#193] — Chinese README translation (@nightire, merged by @sharafdin)
🙏 Contributors
Thank you to all contributors who made this release possible:
- @Olow304 — Structured XLSX pipeline, xlsx vulnerability fix, CLI deploy fix, clippy/lint cleanup
- @nightire — Chinese (Simplified) README translation
- @sharafdin — PR review and merge
- @mo-omar-0197 — README updates
- @intergrado-cg — Reported xlsx security vulnerability (#198)