Download, save and convert multiple subtitles from YouTube videos
Google2SRT allows you to download, save and convert multiple subtitles and translations from YouTube and Google Video to SubRip (.srt) format, which is recognized by most video players. You can download XML subtitles or simply type video's URL, Google2SRT will do the rest.
Editor for simple (only one record type) Csv Files.
Editor for simple (only one record type) Csv Files and Viewer / Updater for Xml files.
ProM is the comprehensive, extensible framework for process mining. Process Mining deals with the a-posteriori analysis of (business) processes using enactment logs.
Web data extraction (web data mining, web scraping) tool. It leverages well proved XML and text processing techologies in order to easely extract useful data from arbitrary web pages.
EXI implementations in Java and C#
Open source .Net (C#) / Java implementation of the W3C Efficient XML Interchange (EXI) format specification. As a corollary to XML, EXI is an alternative, very efficient format that has all of the mechanics of XML, but is much more compact and is faster to exchange. - README (about Nagasena EXI implemenation) https://www.dropbox.com/s/adh83u9z1x1czv6/README.txt?dl=0 - Nagasena EXI grammar interchange format (PDF) https://www.dropbox.com/s/etrpuchaddplq2s/EXIGram.pdf?dl=0 - Introduction to OpenEXI (PDF) https://www.dropbox.com/s/n2545xm0jjyui2d/IntroToOpenEXI.pdf?dl=0
Simple tool for converting large XML-files to JSON or JSON to XML
Simple converter tool with GUI (written on JavaFX) for converting large XML-files to JSON and JSON to XML with indicating progress and uses small amount of memory for converting. Starting from 1.2.0 application supports batch converting files from directory by pattern. Uses Java 1.8+ (http://www.oracle.com/technetwork/java/javase/downloads/jre8-downloads-2133155.html). Distributions for Mac, Linux and Windows already have embedded JRE, so just download appropriate distribution and start application.
Flatten XML into CSV to suit your mood
Java XML to CSV (XML2CSV) generic conversion facility. Flattens one or more similar XML files into CSV projections. I made it in order to extract data from big XML files and gather them in files more easily opened with a spreadsheet because I didn't find anything adapted to my needs over the Internet when I needed to (Java + truly generic + self-contained algorithm + Unix like command line options + efficiency). It is packaged as an auto executable Jar for convenient command line execution but might as well be interfaced directly by a Java class as part of a broader [yet non commercial] software. It handles attributes, repeated elements, and so on, and produces results which level up with what spreadsheets generate when they import native XML (at least in its most extensive execution mode). Please refer to the documentation for further details (PDF doc, Open Office Writer doc, and API doc). This free software is released under the GNU GENERAL PUBLIC LICENSE Version 3.
VTD-XML is the next generation XML parser/indexer/editor/slicer/assembler/xpath-engine that goes beyond DOM, SAX and PULL in performance, memory usage, and ease of use.
The 4Ever framework provides a OO access layer to a single or a set of XML files. The framework offers a set of basic management services as loading/saving, change history, and consistency checking. It is realized in a component-oriented way.
GPSdings (project name GPStools) is a set of free applications that let you manipulate and analyse GPS data from the command line.
Cross-platform visual XSLT generator
Database replication tool based on XML Schema
xsd2pgschema is a Java application suite, which converts XML Schema 1.1 to PostgreSQL DDL and supports XML data migration into PostgreSQL based on the XML Schema without defects on information content. It also supports full text search via either Apache Lucene or Sphinx Search. File convesion from XML to CSV, TSV, or JSON is available as well as mapping XML Schema to JSON Schema. Obtained PostgreSQL database can be optimized at user's discletion. Moreover, XPath/SQL translation, direct XPath addressing and document order preservation are possible by option. Quite large XML file can be split and processed through xmlsplitter, a flexible XML splitter based on XPath and StAX.
Lightweight runtime monitor for AJAX web applications that checks in real time whether XML messages received and sent by the application satisfy a predefined interface specification. Complex message sequences and data values are supported.
Hypermachiavel is a software developped in Java, addressed to end-users such as linguists and humanities researchers, offering various manipulations on an aligned corpus of texts.
The Digital Preservation Recorder (DPR) has been developed by the National Archives of Australia to manage a digital preservation workflow. It features antivirus integration and makes use of the Xena framework for preservation conversions of data objects
Convert Cobol Data Files to/From Xml
This project will convert Cobol Data Files to/From Xml files using a Cobol Copybook. It provides both a Batch and Java/JVM for conversion
Java based XSLT Processor extension for syntax highlighting
This is an implementation of syntax highlighting as an extension module for XSLT processors (Xalan, Saxon), so if you have e.g. article about programming written in DocBook, code examples can be automatically syntax highlighted during the XSLT processing phase.
A library of utility classes that simplify working with the XML APIs provided by the JDK. These have largely been developed to meet specific needs in the maintainers' other professional and personal projects. Requires JDK 1.5 or later. Available from Maven Central: <groupId>net.sf.practicalxml</groupId> <artifactId>practicalxml</artifactId>
Framework for search and display of heterogenous document collections.
The eXtensible Text Framework (XTF) is an architecture that supports searching across collections of heterogeneous textual data (XML, PDF, HTML, text, and more), and the presentation of results and documents in a highly configurable manner. Includes highly customized versions of the proven open-source components Lucene and Saxon.
====== OIL (Object Introspection Library) for C++ ====== Aims at Facilitating the Object Introspection Capabilities for C++ Developers. The Goals is to be able to display and modify the contents of any given C++ Object at run time.
Offline Wily Transaction Inspector 0.16.7.3 BETA
Offline Wily Transaction Inspector is a Java standalone application with which one can analyze trace files offline generated by Wily Introscope Workstation or Webview.
The Accelerator Markup Language (AML) / Universal Accelerator Parser (UAP) project will develop an XML based format for describing high energy particle accelerators along with associated software to convert lattice files to a standard internal struct
EasyML serialization library, to and from XML, similar to Gson
EasyML converts Java objects into XML and back again, without the need for annotations or other types of configuration. EasyML offers extensive support for JDK classes and also supports customization through user settings, user extensions, or through the Java Serialization API. EasyML provides support for: - reading from and writing to XML text - reading from and writing to org.w3c.dom documents - Java Collections framework - Java Serialization framework - Multi-threading - Java Generics EasyML can be customized with user-defined serialization strategies. The low-level components, XMLWriter and XMLReader, can be used directly, for a higher control compared to the EasyML Facade. Security policies can also be defined, specifying black- or whitelists of types which are allowed at deserialization time. EasyML on GitHub: http://github.com/cordisvictor/easyml-lib
XML Application Framework based on JAVA
JART is an XML application framework based on JAVA, using an internal XML programming language. The main goal of the project was to create an high flexible 3 tier based framework for rapid development of web based applications.
File validation and characterization
JHOVE provides functions to perform format-specific identification, validation, and characterization of digital objects. JHOVE should not be confused with JHOVE2, a product with similar aims but a completely separate code base.