An Open Source implementation of Notebook LM with more flexibility
Open-Sora: Democratizing Efficient Video Production for All
Synchronized Translation for Videos
Framework for building, orchestrating, and deploying AI agents
Python crawler for collecting and downloading Sina Weibo user data
Qwen3-Coder is the code version of Qwen3
Algorithms for outlier, adversarial and drift detection
Powerful Android AI agent with tools, automation, and Linux shell
Using AI models to automatically provide commentary and edit videos
Qwen3-ASR is an open-source series of ASR models
Build Vision Agents quickly with any model or video provider
Multi-lingual large voice generation model, providing inference
The best free open source website change detection and restock service
An open source implementation of CLIP
A Multi-Modal World Model for Reconstructing, Generating, Simulation
A wiki system with complex functionality for simple integration
Multimodal AI chat app with dynamic conversation routing
Bidirectional token-classification model for identifiable info
Supercharge Your LLM with the Fastest KV Cache Layer
Quick illustration of how one can easily read books together with LLMs
A python tool that uses GPT-4, FFmpeg, and OpenCV
Large-language-model & vision-language-model based on Linear Attention
Turn words into colors
Fast multimodal LLM for real-time voice interaction and AI apps
Autoregressive Model Beats Diffusion