PDF transformation/manipulation program + library
QPDF is a C++ library and set of programs that inspect and manipulate the structure of PDF files. It can encrypt and linearize files, expose the internals of a PDF file, and do many other operations useful to end users and PDF developers.
Android adb backup extractor and creator
A Simple CSV Parser for Java under a commercial-friendly Apache 2.0 license
Free open source ETL software for data integration anywhere.
Expand your open source stack with a free open source ETL tool for data integration and data transformation anywhere. Work with the latest cloud applications and platforms or traditional databases and applications using Open Studio for Data Integration to design and deploy quickly with graphical tools, native code generation, and 100s of pre-built components and connectors. Open Studio for Data Integration is fully open source, so you can see the code and work with it. Embed existing Java code libraries, create your own components or leverage community components and code to extend your project. Millions of downloads and a full range of robust, open source integration software tools have made Talend the open source leader in cloud and big data integration.
Fedora Commons provides open-source software to ensure durability and integrity of digital content, use semantics to contextualize and inter-relate content from many sources, and to enable the creation of innovative, collaborative information spaces.
Cross-platform DICOM implementation
Grassroots DiCoM is a C++ library for DICOM medical files. It is accessible from Python, C#, Java and PHP. It supports RAW, JPEG, JPEG 2000, JPEG-LS, RLE and deflated transfer syntax. It comes with a super fast scanner implementation to quickly scan hundreds of DICOM files. It supports SCU network operations (C-ECHO, C-FIND, C-STORE, C-MOVE). PS 3.3 & 3.6 are distributed as XML files. It also provides PS 3.15 certificates and password based mecanism to anonymize and de-identify DICOM datasets.
The National Library of New Zealand's Metadata Extraction Tool automatically extracts preservation-related metadata from digital files, then output that metadata in XML formats. It can be used through a graphical user interface or command-line interface. Please take the latest code from 'https://github.com/DIA-NZ/Metadata-Extraction-Tool.git'. The code on source forge will not be updated henceforth as it is moved to github.
Project moved to GitHub
Joda-Time provides a library of classes to replace the Java JDK Date and Time classes including formatting. It is based around the ISO8601 datetime standard, but also provides full support for other calendar systems, such as Gregorian and Buddhist. Project moved to GitHub: https://github.com/JodaOrg/joda-time
PyXB (“pixbee”) is a pure Python package that generates Python source code for classes that correspond to data structures defined by XMLSchema. In concept it is similar to JAXB for Java and CodeSynthesis XSD for C++.
A set of tools for working with high-throughput sequencing data
A set of tools (in Java) for working with next generation sequencing data in the SAM/BAM format. Note that development has moved to GitHub at https://github.com/broadinstitute/picard and support is available on the GATK forum at http://gatkforums.broadinstitute.org/categories/ask-the-team
The DITA Open Toolkit is an implementation of the OASIS DITA XML Specification. The Toolkit transforms DITA content into many deliverable formats. See http://dita-ot.sourceforge.net/ for documentation and links to downloads. The source code and issue trackers have been moved to https://github.com/dita-ot/dita-ot
Simple Java API Windows style .ini file handling. Also provide Java Preferences API functionality on top of .ini file.
Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
Kabeja is a java library for parsing DXF and converting to SVG (dxf2svg). The library supports the SAX-api and can integrated into other applications (Cocoon,Batik). Tools for converting svg to jpeg, tiff, png and pdf are included .
A software toolkit for using the EPCglobal Low Level Reader Protocol (LLRP) to communicate with conforming RFID readers.
EXI implementations in Java and C#
Open source .Net (C#) / Java implementation of the W3C Efficient XML Interchange (EXI) format specification. As a corollary to XML, EXI is an alternative, very efficient format that has all of the mechanics of XML, but is much more compact and is faster to exchange. - README (about Nagasena EXI implemenation) https://www.dropbox.com/s/adh83u9z1x1czv6/README.txt?dl=0 - Nagasena EXI grammar interchange format (PDF) https://www.dropbox.com/s/etrpuchaddplq2s/EXIGram.pdf?dl=0 - Introduction to OpenEXI (PDF) https://www.dropbox.com/s/n2545xm0jjyui2d/IntroToOpenEXI.pdf?dl=0
An XQuery and XPath 2.0 library, written in C++ and built on top of Xerces-C.
eLML (eLesson Markup Language) is an XML framework for creating structured eLessons based on a pedagogical model. eLML consists of an XMLSchema and XSLT files to create XHTML, PDF, LaTeX, IMS CP and SCORM versions, standards supported by most LMS.
Конвертация DBF файлов в MySQL, CSV или XML. Русский, English, Deutsch, Español, Italiano, Français, Čeština, Română. http://dbf-to-mysql-csv-xml.sourceforge.net
JSefa is a simple library for stream-based serialization of java objects to XML, CSV, or FLR and back again using an iterator-style interface independent of the serialization format. There is a standard and an android edition available.
The 4Ever framework provides a OO access layer to a single or a set of XML files. The framework offers a set of basic management services as loading/saving, change history, and consistency checking. It is realized in a component-oriented way.
Program converts HTML pages into LaTeX format. Own mappings between HTML tags and character entities can be defined. CSS formatting properties are also supported (including colours). Implemented in Java.
Talend Spatial Module (aka Spatial Data Integrator or SDI) is an ETL tool for geospatial. Based on Talend Open Studio, input, output and transform geocomponents are available. IO components read/write GIS formats(eg.PostGIS, GeoRSS). Transformers all
Low level, "surgical strike" PDF toolkit Complete low level PDF acces
Cross Platform, Low level, "surgical strike" PDF toolkit Complete low level PDF access - no limit to what you can do Create, edit, optimize, import, export, fully de-construct PDF's Simple low level API to PDF objects You build the SDK from scratch using 100% open source libs Our ÜberBuild™ utility can build it for you A serious tool for serious coders
The VCS to ICS Calendar Converter allows you to convert VCALENDAR files (for example generated by Nokia nbuexplorer or Microsoft Works) to the newer iCalendar format, used by Android and iOS for example. It allows the parsing of events and todo's.