OCR expert VLM powered by Hunyuan's native multimodal architecture
The repository provides code for running inference with SAM 2
Dealing with all unstructured data, such as reverse image search
Pre-trained Deep Learning models and demos
Git-based data version control for machine learning workflows
Your Personal Research Multi-Tool
Recovering the Visual Space from Any Views
Scalable machine learning for time series forecasting
PPTAgent: Generating and Evaluating Presentations
Simple, unified interface to multiple Generative AI providers
Tools for merging pretrained large language models
A lightweight framework for building LLM-based agents
Helping you get the most out of AWS, wherever you use MCP
Hub of ready-to-use datasets for ML models
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
Repo of Qwen2-Audio chat & pretrained large audio language model
The official PyTorch implementation of Google's Gemma models
Tutorial tailored for Chinese babies on rapid fine-tuning
Solve end to end problems using Llama model family
Generate high-definition story short videos with one click using AI
Implementation of 'lightweight' GAN, proposed in ICLR 2021
Framework to easily create LLM powered bots over any dataset
Powering Amazon custom machine learning chips
GEO-first SEO skill for Claude Code
Give your AI agent eyes to see the entire internet