PyTorch library of curated Transformer models and their components
Instant voice cloning by MIT and MyShell. Audio foundation model
An open source implementation of CLIP
Open source codebase for Scale Agentex
A coding-free framework built on PyTorch
A fast and lightweight framework for creating decentralized agents
Large Multimodal Models for Video Understanding and Editing
RL research on Android devices
Automate browser-based workflows with LLMs and Computer Vision
Django friendly finite state machine support
Jittor is a high-performance deep learning framework
Structured outputs for llms
OCR expert VLM powered by Hunyuan's native multimodal architecture
General proxy performance testing tool based on Clash using Telegram
Interpretable prompting and models for NLP
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Models and examples built with TensorFlow
Python package for AutoML on Tabular Data with Feature Engineering
Deep learning library
4M: Massively Multimodal Masked Modeling
Guiding Instruction-based Image Editing via Multimodal Large Language
Agent toolkit providing semantic retrieval and editing capabilities
PyTorch code and models for the DINOv2 self-supervised learning
The Unified Machine Learning Framework
The Operator Splitting QP Solver