Pretrained time-series foundation model developed by Google Research
Natural language workflows for AI agents
Follow along with my AI Agents Masterclass videos
Run all your local AI together in one package
Document Image Parsing via Heterogeneous Anchor Prompting”
Omnilingual ASR Open-Source Multilingual SpeechRecognition
A step-by-step guide to build your own AI agent
An AI-powered file management tool that ensures privacy
AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework
Generate blog articles from video or audio
Collections of robotics environments
DeepMind model for tracking arbitrary points across videos & robotics
MCP integration platforms for AI agents to use tools at any scale
Constrained Value Alignment via Safe Reinforcement Learning
An AI for Music Generation
Request recommended movies, TV shows and anime to Jellyseer/Overseer
Build Vision Agents quickly with any model or video provider
Plug-and-play library to enable agents to call MCP and UTCP tools
Datasets, transforms and models specific to Computer Vision
AI agents running research on single-GPU nanochat training
AI agents running research on single-GPU nanochat training
Run AI models end-to-end encrypted
Towards Human-Sounding Speech
OCR expert VLM powered by Hunyuan's native multimodal architecture
Benchmarking synthetic data generation methods