DocWire SDK News

Award-winning modern data processing SDK in C++20

Brought to you by: novitzmann, silvercoders

DocWire SDK / News: Recent posts

Thank You SoureForge for Rising Star award

We’re happy to share that DocWire SDK has received the Rising Star Award here on SourceForge.

It’s a next milestone for us — a sign that the project is gaining real momentum and solving problems for developers working with complex documents and data pipelines in C++.

DocWire SDK is a modern data processing toolkit built for performance, portability, and clean C++ design. It’s already being used in production by companies in AI, digital forensics, and consulting — and we’re looking to grow further.... read more

Posted by 2025-07-10

DocWire SDK – A Journey of Innovation in Data Extraction 2024 - 2025

Empowering C++ Developers with Cutting-Edge Data Processing

Over the past year, DocWire SDK has rapidly evolved, bringing powerful data extraction, parsing, and content processing capabilities to C++ developers worldwide. From foundational improvements in performance and stability to advanced AI-driven text analysis, our SDK has grown into an essential tool for anyone dealing with structured and unstructured data. ... read more

Posted by 2025-02-19 Labels: #C++20SDK #ContentParsing #OCR #Data-Extraction #AI-Integration #NLP-Integration #High-Performance-Parsing #Modular-API #Document-Processing #Content-Parsing #CPP20 #C++

Latest and the greatest

We’ve been busy making things faster, smoother, and all-around better for you. Here’s the lowdown:

Big Overhaul & Performance Boosts:

Speed Enhancements: New caching and memory management tricks mean everything runs a lot faster.
PST Parser Fix: Fixed a bug that was limiting mail processing – no more data loss!
Exporters Enhanced: Added metadata support in HTML exporters and fixed issues in the EML parser for more accurate exports.
Modern C++ Practices: We’ve upgraded to use move semantics and smart pointers, making our code safer and faster. No more unnecessary copying – it’s lean and mean now!
Simplified Code: We cleaned house by getting rid of outdated stuff like ParserWrapper and wrapper_parser_creator. Now, it’s much easier to understand and work with.
Better Parsing: We moved parsing duties from individual parsers to the Importer class. This makes everything more streamlined and independent.
PDFParser Upgrade: We swapped out std::mutex lock() and unlock() with std::lock_guard for simpler and safer code.
Bye-Bye Old Code: Out with the old! We replaced the FormattingStyle class and std::bind with shiny new lambda expressions.
Performance Comparison Tool: New script to compare SDK performance – great for making sure we’re always improving.... read more

Posted by 2024-07-25 Labels: c++20 C++SDK C++20SDK PDFParser DataExtraction DataProcessing C++20Library

DocToText Data Extraction SDK 5.0.9

Introducing the New Version of DocToText 5.0.9 SDK: Enhanced Features for Effortless Data Processing

We are thrilled to announce the release of the latest version of DocWire’s data extraction SDK. This version is packed with lots of powerful features that will help in streamlining the process of extracting, importing, and exporting various data types. Let’s dive into the latest features that make this version a game-changer in the field of data extraction. ... read more

Posted by 2023-07-06

Rebranding

NEWS RELEASE

** Silvercoders Announce Rebranding to Docwire**

The time-saving backbone of document processing, we provide the tools an organization needs in order to extract any data type of data

Silvercoders, a company that has provided data extraction tools and services for years, has announced today that it has completed a significant rebranding. The step was taken by the CEO and the board in order to reflect the new mission and vision of the company, namely: helping businesses to effectivise their operations by providing fast & dynamic document processing solutions.... read more

Posted by 2023-01-15

DocToText version 4.0 was released today.

DocToText version 4.0 was oficially released today. After introducing PDF, iWork, XLSB, OpenDocument Flat XML and EML (email) this version of utility supports all important document formats on the market. Support for Object Linking and Embedding (OLE) in ODF formats added. Win64 is officially supported since this version. Capabilities of C API has been expanded significally. A lot of fixes and improvements, including improvements for multithreaded applications.

Posted by 2014-01-07

DocToText version 0.13.0 released today.

DocToText version 0.13.0 was oficially released today. This is the first version available for Mac OS X and also the first version available as a C/C++ library in addition to the console application. MS PowerPoint binary format (PPT) support was added. Headers, footers and embedded XLS workbooks in DOC files are supported. Extracting text from OpenDocument and OOXML formats was significally optimized. In addition a lot of bugs were fixed in this version.

Posted by 2012-10-19