Build Vision Agents quickly with any model or video provider
All Algorithms implemented in Python
Framework for building neural networks
Unified Multimodal Understanding and Generation Models
World of apps for benchmarking interactive coding agent
PDF to Markdown with vision models
The Multi-Agent Framework
Motion-controllable Video Generation via Latent Trajectory Guidance
Unified Model Serving Framework
Build cross-modal and multimodal applications on the cloud
A Universal Customization Method for Single and Multi Conditioning
A lightweight vision library for performing large object detection
TensorFlow is an open source library for machine learning
Haystack is an open source NLP framework to interact with your data
Large-language-model & vision-language-model based on Linear Attention
AI-data warehouse to enrich, transform and analyze unstructured data
Segmentation models with pretrained backbones. PyTorch
Simple and rapid application development framework
Ongoing research training transformer models at scale
code for Mesh R-CNN, ICCV 2019
An end-to-end Data Scientist
PaddlePaddle End-to-End Development Toolkit
Natural language workflows for AI agents
Harmonized and Coherent Human Image Animation