Kubeflow Training Operator is a Kubernetes-native project for fine-tuning and scalable distributed training of machine learning (ML) models created with various ML frameworks such as PyTorch, TensorFlow, XGBoost, MPI, Paddle, and others.
Features
- TensorFlow Release Only
- Python SDK for Kubeflow Training Operator
- Documentation available
- Examples available
- TensorFlow API Definition
- Use Kubernetes workloads to effectively train your large models via Kubernetes Custom Resources APIs
- Use Training Operator Python SDK
Categories
Machine LearningLicense
Apache License V2.0Follow Kubeflow Training Operator
Other Useful Business Software
Add Two Lines of Code. Get Full APM.
Works out of the box for Rails, Django, Express, Phoenix, and more. Monitoring exceptions and performance in no time.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of Kubeflow Training Operator!