Conduit streams data between data stores. Kafka Connect replacement
Code review for data in dbt
Pythonic tool for running machine-learning/high performance workflows
Open-source data observability for analytics engineers
BitSail is a distributed high-performance data integration engine
Next-Generation Event Processing Platform
SeaTunnel is a distributed, high-performance data integration platform
Haskell pretty printer
lakeFS - Git-like capabilities for your object storage
Open source framework for deep learning satellite and aerial imagery
Unified Model Serving Framework
AutoGluon: AutoML for Image, Text, and Tabular Data
StarRocks is a next-gen sub-second MPP database for full analytics
A framework for building messaging apps with .NET and C#
End-to-end typesafe APIs made easy
Always know what to expect from your data
Redis-based components for Scrapy
Stanford NLP Python library for many human languages
Concourse is a container-based continuous thing-doer written in Go
Gitness is an Open Source developer platform with Source Control
Contextually-keyed word vectors
Elyra extends JupyterLab with an AI centric approach
Kubernetes-native platform to run massively parallel data/streaming
The One CD for All {applications, platforms, operations}
MLOps simplified. From ML Pipeline ⇨ Data Product without the hassle