Pachyderm

Data-driven pipelines automatically trigger based on detecting data changes. Automatic immutable data lineage and data versioning of all data types. Autoscaling and parallel processing built on Kubernetes for resource orchestration. Uses standard object stores for data storage with automatic deduplication. Runs across all major cloud providers and on-premises installations. Automatic and intelligent versioning of even the largest data sets of unstructured and structured data. Git-like structure enables effective team collaboration. Full versioning for metadata including all analysis, parameters, artifacts, models, and intermediate results. Automatically produces an immutable record for all activities and assets. Pachyderm is used across a variety of industries and use cases. Pachyderm provides a powerful solution to optimize data processing, MLOps, and ML Lifecycles.

Features

Automate Complex Pipelines
Automatically produces an immutable record for all activities and assets
Runs across all major cloud providers and on-premises installations
Autoscaling and parallel processing built on Kubernetes for resource orchestration
Natural Language
Optimize data processing, MLOps, and ML Lifecycles

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow Pachyderm

Pachyderm Web Site

User Reviews

Be the first to post a review of Pachyderm!

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

Related Categories

Go Frameworks

Registered

2023-01-06

Similar Business Software

CubicWeb

Modeling your data is the first step, as it always should be because applications fade away but data is here to stay. Once your model is implemented, your CubicWeb application runs and you can incrementally add high-value functionalities for your users. Based on the application model, RQL is a...

See Software
Horovod

Horovod was originally developed by Uber to make distributed deep learning fast and easy to use, bringing model training time down from days and weeks to hours and minutes. With Horovod, an existing training script can be scaled up to run on hundreds of GPUs in just a few lines of Python code....

See Software
ADO.NET Data Providers

dotConnect is an enhanced data connectivity solution built over ADO.NET architecture and a development framework with a number of innovative technologies. dotConnect includes high-performance data providers for the major databases and popular cloud applications and offers a complete solution for...

See Software

Report inappropriate content

Pachyderm

Data-Centric Pipelines and Data Versioning

Features

Project Samples

Project Activity

Categories

License

Follow Pachyderm

User Reviews

Additional Project Details

Operating Systems

Programming Language

Related Categories

Registered