Open-Sora: Democratizing Efficient Video Production for All
Framework for building, orchestrating, and deploying AI agents
Concatenate a directory full of files into a single prompt
Long-form streaming TTS system for multi-speaker dialogue generation
Marrying Grounding DINO with Segment Anything & Stable Diffusion
Bailing is a voice dialogue robot similar to GPT-4o
Making RAG Simpler with Small and Open-Sourced Language Models
Extension of Google Research’s PaperBanana
Retrieval and Retrieval-augmented LLMs
Python tool for crawling and extracting structured data from news site
Large-language-model & vision-language-model based on Linear Attention
Implementation of Make-A-Video, new SOTA text to video generator
Phi-3.5 for Mac: Locally-run Vision and Language Models
Autoregressive Model Beats Diffusion
StarVector is a foundation model for SVG generation
General-purpose image editing model that delivers high-fidelity
Diffusion Transformer with Fine-Grained Chinese Understanding
Python crawler for collecting and downloading Sina Weibo user data
Python CLI utility and library for manipulating SQLite databases
A Pioneering Open-Source Alternative to GPT-4o
"Big Model" trains a visual multimodal VLM with 26M parameters
Open source AI VTuber platform with voice chat and Live2D avatars
AI-Powered Personalized Learning Assistant
Visual Causal Flow
A system for agentic LLM-powered data processing and ETL