There's plenty of other CSV parsers in the wild, but I had a hard time finding what I wanted. Inspired by Python's csv module, I wanted a library with simple, intuitive syntax. Furthermore, I wanted support for special use cases such as calculating statistics on very large files. With the deluge of large datasets available, a performant CSV parser is a necessity. By using overlapped threads, memory-mapped IO, and efficient data structures, this parser can quickly tackle large CSV files. Furthermore, this parser has a minimal memory footprint and can handle larger-than-RAM files.
Features
- This CSV parser is much more than a fancy string splitter, and parses all files following RFC 4180
- This CSV parser is encoding-agnostic and will handle ANSI and UTF-8 encoded files
- By default, rows of variable length are silently ignored, although you may elect to keep them or throw an error
- This CSV parser has an extensive test suite and is checked for memory safety with Valgrind
- It does not try to decode UTF-8, except for detecting and stripping UTF-8 byte order marks
- This library was developed with Microsoft Visual Studio
Categories
JSONLicense
MIT LicenseFollow Vince's CSV Parser
Other Useful Business Software
Train ML Models With SQL You Already Know
Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of Vince's CSV Parser!