**CODE MOVED TO GITHUB: https://github.com/bitextor ** Bitextor is an application created to generate translation memories using multilingual websites as a corpus source. It downloads an entire website and applies a set of heuristics (based mainly on HTML tag structure and text block length) to find bitexts.
Dynamically loaded extension libraries for GNU AWK
The gawkextlib project provides several extension libraries for gawk (GNU AWK), as well as libgawkextlib containing some APIs that are useful for building gawk extension libraries. These libraries enable gawk to process XML data, interact with a PostgreSQL database, use the GD graphics library, and perform unlimited precision MPFR calculations. These extensions work with GNU AWK version 4.1.1 or later. We have created a framework for packaging gawk extensions, and we welcome further contributions. Recent additions include haru, redis, and select for I/O multiplexing.
Flatten XML into CSV to suit your mood
Java XML to CSV (XML2CSV) generic conversion facility. Flattens one or more similar XML files into CSV projections. I made it in order to extract data from big XML files and gather them in files more easily opened with a spreadsheet because I didn't find anything adapted to my needs over the Internet when I needed to (Java + truly generic + self-contained algorithm + Unix like command line options + efficiency). It is packaged as an auto executable Jar for convenient command line execution but might as well be interfaced directly by a Java class as part of a broader [yet non commercial] software. It handles attributes, repeated elements, and so on, and produces results which level up with what spreadsheets generate when they import native XML (at least in its most extensive execution mode). Please refer to the documentation for further details (PDF doc, Open Office Writer doc, and API doc). This free software is released under the GNU GENERAL PUBLIC LICENSE Version 3.
libwbxml is a library to encode and decode WBXML (WAP Binary XML).
DECOR and ART
ART-DECOR is an open-source tool suite that supports the creation and maintenance of HL7 Templates, Value Sets as well as Data Sets and features cloud-based federated Building Block Repositories (BBR) for Templates and Value Sets. The tool offers a Data Set and a Scenario editor, two Template editors, a Value Set editor and includes browsers for various international terminologies such as LOINC. It supports comprehensive collaboration of team members within and between governance groups. For an overview see also ART-DECOR: An Open-Source Tool Bridging the Chasm Between Clinicians and Health IT, HL7 News, September 2014, here. ART-DECOR is used in over 30 projects throughout Europe and other parts of the world, e.g. the national infrastructure ELGA in Austria, the Dutch Nictiz (National Healthcare Standards Institute), the RIVM (National Institute of Public Health and the Environment in the Netherlands), HL7 and IHE Germany.
Minimalistic address book in web browser. No server or plugin needed.
Minimalistic but full-featured addressbook in your web browser. adx is a standalone and portable web app (online and offline). FEATURES Contact Management, portable, small (~200KB), lightweight, contact tagging, geo mapping, web accounts, trigger phone/Skype calls, etc. EXPORT FUNCTIONALITY vCard (as file or QR code via offline generator), embedded Microformats (hCard 1.0, XFN). HOW IT WORKS Your address-book (XML file) is transformed in your web browser (via XSLT) to a full-featured web application (HTML). REQUIREMENTS Web browser for viewing online or offline (some like Chrome need a small command line parameter for offline viewing); no server, plugin or anything else needed. Any text editor can be used for contact editing (addressbook.xml).
XML Schema for questionnaires and PDF questionnaire generator
queXML is a simple XML schema for designing questionnaires. Included are stylesheets to administer the questionnaire in PDF (paper), CASES and LimeSurvey. queXML is compatible with the DDI standard.
The HandCoded Toolkit for FpML processing is a library supporting functions for manipulating FpML documents implemented both in Java and C#.
Simulatoralive's Java libraries and programs
This is the home of my current Java projects, including a few libraries and programs for various purposes.
The Digital Preservation Recorder (DPR) has been developed by the National Archives of Australia to manage a digital preservation workflow. It features antivirus integration and makes use of the Xena framework for preservation conversions of data objects
Java based XSLT Processor extension for syntax highlighting
This is an implementation of syntax highlighting as an extension module for XSLT processors (Xalan, Saxon), so if you have e.g. article about programming written in DocBook, code examples can be automatically syntax highlighted during the XSLT processing phase.
EasyML serialization library, to and from XML, similar to Gson
EasyML converts Java objects into XML and back again, without the need for annotations or other types of configuration. EasyML offers extensive support for JDK classes and also supports customization through user settings, user extensions, or through the Java Serialization API. EasyML provides support for: - reading from and writing to XML text - reading from and writing to org.w3c.dom documents - Java Collections framework - Java Serialization framework - Multi-threading - Java Generics EasyML can be customized with user-defined serialization strategies. The low-level components, XMLWriter and XMLReader, can be used directly, for a higher control compared to the EasyML Facade. Security policies can also be defined, specifying black- or whitelists of types which are allowed at deserialization time. EasyML on GitHub: http://github.com/cordisvictor/easyml-lib
Simple C++ XML processing
Simple XML processing using C++14: -> header-only -> fully portable -> simple API -> powerful features -> high performance Overview: http://zenxml.sourceforge.net
Cross-platform visual XSLT generator
A library of utility classes that simplify working with the XML APIs provided by the JDK. These have largely been developed to meet specific needs in the maintainers' other professional and personal projects. Requires JDK 1.5 or later. Available from Maven Central: <groupId>net.sf.practicalxml</groupId> <artifactId>practicalxml</artifactId>
Database replication tool based on XML Schema
xsd2pgschema is a Java application suite, which converts XML Schema 1.1 to PostgreSQL DDL and supports XML data migration into PostgreSQL based on the XML Schema without defects on information content. It also supports full text search via either Apache Lucene or Sphinx Search. File convesion from XML to CSV or JSON is available as well as mapping XML Schema to JSON Schema. Obtained PostgreSQL database can be optimized at user's discletion. Moreover, XPath/SQL translation, direct XPath addressing and document order preservation are possible by option. Quite large XML file can be split and processed through xmlsplitter, a flexible XML splitter based on XPath and StAX.
C++ library for working with OWL ontologies
XSH is a powerfull command-line XML editing tool/programming language in the manner of Unix shell interpreters and line-oriented text editors like ed which can be used either interactively or for batch-mode XML processing.
An XML parser written in the REXX programming language. It runs on mainframes (z/OS Rexx) as well as Windows and Linux (Regina or ooRexx). Includes example Rexx programs that use the parser such as: JCL2XML (converts z/OS JCL into an XML format), AUX2SVG (converts a z/OS CICS auxiliary trace file into a visual SVG format), PRETTY (an XML pretty printer), DEVISIO (an example of removing unwanted tags from an SVG file).
Lightweight runtime monitor for AJAX web applications that checks in real time whether XML messages received and sent by the application satisfy a predefined interface specification. Complex message sequences and data values are supported.
SEPA support tools and library
Full set of progams to create, edit and view SEPA documents.
XML-Parse library is a lightweight set of functions for parsing, checking, and creating xml files. It can support stream-oriented, SAX or DOM parsing styles, and includes an optional xsd schema validator and graphical schema generator.
GPSdings (project name GPStools) is a set of free applications that let you manipulate and analyse GPS data from the command line.
Convert Cobol Data Files to/From Xml
This project will convert Cobol Data Files to/From Xml files using a Cobol Copybook. It provides both a Batch and Java/JVM for conversion
herstellerunabhängige Datenschnittstelle für ÖPV Vertriebssysteme
Herstellerunabhängige Datenschnittstelle zum Austausch von Tarifdaten und Ergebnisdaten zur Unterstützung von Vertriebsanwendungen im öffentlichen Verkehr ÖPV/ÖPNV (Fahrscheinverkauf, eTicketing, EBE für Bus, Bahn etc.).