OGSA-DAI is a product that allows data resources, such as file collections, relational or XML databases, to be accessed, integrated and federated across the Internet.
Workflow Designer, Hive Editor, Pig Editor, File System Browser
Flamingo is a open-source Big Data Platform that combine a Ajax Rich Web Interface + Workflow Engine + Workflow Designer + MapReduce + Hive Editor + Pig Editor.
1. Easy Tool for big data
2. Use comfortable in Hadoop EcoSystem projects
3. Based GPL V3 License
Supporting Pig IDE, Hive IDE, HDFS Browser, Scheduler, Hadoop Job Monitoring, Workflow Engine, Workflow Designer, MapReduce.
webStraktor is a programmable World Wide Web data extraction client. Its purpose is to scrape HTML based content via the HTTP protocol and extract relevant information. webStraktor features a scripting language to facilitate the collection, the extraction and the storage of information available on the web, including images. The scripting language uses elements of the Regular Expression and xPath syntax. The webStraktor scripting language has a small instruction set and its syntax is easy...
Open source Extract Transform Load engine written in Java
ETL Framework is a standalone Extract Transform Load engine written in Java. It includes executables for all major platforms and can be easily integrated into other applications.
Key Features:
* embeddable, open source and free
* fast and scalable
* uses target database features to do transformations and loads
* manual and automatic data mapping
* data streaming
* bulk data loads
* data quality features using SQL, JavaScript? and regex
* data transformations
Requirements
*...
Apatar is an open source Extract, Transform, and Load (ETL) project. Modular architecture delivers 1. Visual job designer/mapping 2. Connectivity to all major data sources 3. Flexible Deployment Options (GUI, or server engine with JVM, or embedded).
This project is to develop web applications and data integration functions to provide information on the collection records, ecology, geographic distribution, and taxonomic concepts of the vascular flora of the region.
SPIDR (Space Physics Interactive Data Resource) is a distributed database and application server network, built to select, visualize and model historical space weather data. SPIDR is a web-application and a grid of data mining web-services.
AppSignal starts at $23/month with all features included. No overages, no hidden fees. 30-day free trial.
Tired of monitoring tools that punish you for scaling? AppSignal offers transparent, predictable pricing with every feature unlocked on every plan. Track errors, monitor performance, detect anomalies, and manage logs across Ruby, Python, Node.js, and more. Trusted by developers since 2012 with free dev-to-dev support. No credit card required to start your 30-day trial.
AMB New Generation Data Empowerment - offers a comprehensive approach to data governance needs with ground breaking features to locate, identify, discover, manage and protect your overall data infrastructure. Repeatable Process/Exposed Repository.
Salesforce Integration, legacy data migration and ETL Tool. Create and sync an exact replica of your Salesforce data to a local data store, migrate legacy data to Salesforce. Available for MySQL, Oracle, SQLServer. http://www.openforcesi.org/
Trauma registry suite; Data collection application and server scripts to build trauma data warehouse and perform web-based analysis reporting. Cross-platform compatible for Windows, Apple, Unix, or Linux.
FreeAnalysis is a complete java (Eclipse RCP) and Web 2.0 (Dojo) application that provide Olap functions against Pentaho Mondrian Olap Server and other MDX/XMLA compliant cubes datasources such as Microsoft Analysis or Hyperion.
Set of tools and libs for
managing structured data
in a very flexible way:
Imp./Exp. ASCII, XML, SQL,
PS, Tex/LaTex, RTF
GUI: X-Windows, MS-Windows
Interface to C++, DBs, Perl,
PHP, Java, TCP/IP
LISP-like interpreter
written in C++ using C-LIB
Object Relational Persistence and Object-Based SQL Query API. Similar to that of Hibernate or JDO but much better structured design. This API has several layers of abstraction to be useful for nearly any situation.
JGV is a Java applet that lets you embed movable 3D objects in web pages. Users can rotate, scale, translate, select and annotate the objects interactively. Also, JGV supports XML-based transmission of user events to the source web server over HTTP.
Visualization System is a server application for presenting and manipulating XML-based adapted multimedia data, useful eg. in mobile environement. It is component based framework for additional modules developement.
JWDE extracts product information set from web and dumps is into db so e-commerce packages can use it. Currently JWDE can extract information from saved HTML files and convert it into JWDE XML format which can be save to osCMax e-commerce product.