StarVector is a foundation model for SVG generation
The most powerful local music generation model
An Open Source implementation of Notebook LM with more flexibility
Flowly is 100x faster than OpenClaw
High-Resolution Image Synthesis with Latent Diffusion Models
A high-quality PDF to Markdown tool based on large language model
Open source personal AI Assistant for Linux, Windows and Mac
Open source machine learning framework to automate text conversations
Enhances Tesseract OCR output using LLMs (local or API)
Code and models for ICML 2024 paper, NExT-GPT
StreamSpeech is a seamless model for offline speech recognition
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Foundational model for human-like, expressive TTS
A Systematic Framework for Interactive World Modeling
Parse files for optimal RAG
An MCP server that autonomously evaluates web applications
Multilingual sentence & image embeddings with BERT
Automated translation solution for visual novels
General-purpose image editing model that delivers high-fidelity
Accurate × Fast × Comprehensive
Underthesea - Vietnamese NLP Toolkit
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Open Source Document Management System for Digital Archives
Large-language-model & vision-language-model based on Linear Attention
Build Vision Agents quickly with any model or video provider