A high performance real-time analytics database
Apache Impala
Data Version Control | Git for Data & Models
Apache Drill is a distributed MPP query layer for self describing data
cloud-native file store
Get random, realtime read/write access to your Big Data
JuiceFS is a distributed POSIX file system built on top of Redis
Upserts, Deletes And Incremental Processing on Big Data
Python module that helps you build complex pipelines of batch jobs
TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters
A flexible and efficient library for deep learning
Read Cobol data files in Java
Active, high-performance open source database middleware
Hadoop spliced read aligner for RNA-seq data
Hadoop mapreduce maven plugin
Distributed RDF Processing over Hadoop
File transfer from local FS to HDFS
Workflow Designer, Hive Editor, Pig Editor, File System Browser
This project aims to provide P2P capabilities with Hadoop DFS.
A Dynamic Slot Allocation and Scheduling System for MapReduce Clusters
Integrated to system status data based on the HDFS
Console File Manager for Hadoop, written on java.