The SOTA Open-Source Browser Agent
Pre-trained Deep Learning models and demos
Repo of Qwen2-Audio chat & pretrained large audio language model
Global weather forecasting model using graph neural networks and JAX
Multilingual sentence & image embeddings with BERT
Ollama Python library
GLM-4 series: Open Multilingual Multimodal Chat LMs
Modern Flask framework optimized for AI-assisted development
Open-source MCP server that gives your coding agent
Large Language Model Principles and Practice Tutorial from Scratch
Definitions for AI/ML tasks like dataset creation
Long-form streaming TTS system for multi-speaker dialogue generation
Block Diffusion for Ultra-Fast Speculative Decoding
Implementation of "MobileCLIP" CVPR 2024
Simple, unified interface to multiple Generative AI providers
Pythonic tool for running machine-learning/high performance workflows
Official SeedVR2 Video Upscaler for ComfyUI
OCR expert VLM powered by Hunyuan's native multimodal architecture
An adaptive Web Scraping framework
Collection of cybersecurity-related references, scripts, tools, code
AI tool for detecting complex vulnerabilities in Python codebases
AI-driven neuro-symbolic solver for high-school geometry problems
LLM-based Reinforcement Learning audio edit model
Pretrained time-series foundation model developed by Google Research
Inference script for Oasis 500M