Embedded Template Library
Zero-ETL, infinite possibilities. Live query APIs, code & more
Library providing end-to-end GPU-accelerated recommender systems
Open source semantic search and text analytics for large document sets
Superduper: Integrate AI models and machine learning workflows
Easy integration with Athena, Glue, Redshift, Timestream, Neptune
A system for agentic LLM-powered data processing and ETL
All-in-one AI framework & toolkit for Claude Code & Cursor
A unified analytics engine for large-scale data processing
Pentaho offers comprehensive data integration and analytics platform.
Text-based user interface to query data on Oracle DB in a smart way
ETL engine based on Groovy
NBi is a testing framework (add-on to NUnit)
Streaming csv parser inspired by binary-csv that aims to be faster
A Chinese information extraction tool
Design, automate, operate and publish data pipelines at scale
osDQ dedicated to create apache spark based data pipeline using JSON
Java utility that reads the metadata from table(s)
Free open source geocoder
automate Informatica control file creation
A utility that uses Informatica Operations API
PostgreSQL Bulk Data Loader