text and image to video generation: CogVideoX (2024) and CogVideo
Generate audiobooks from e-books, voice cloning & 1107+ languages
No fortress, purely open ground. OpenManus is Coming
Python inference and LoRA trainer package for the LTX-2 audio–video
EPUB to audiobook converter, optimized for Audiobookshelf
A community-supported supercharged version of paperless
Open source annotation tool for machine learning practitioners
Ready-to-use OCR with 80+ supported languages
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
A python tool that uses GPT-4, FFmpeg, and OpenCV
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Powerful tool that lets you create and run intelligent agents
An experimental version of DeepSeek model
Qwen2.5-VL is the multimodal large language model series
Chemcrow
Reference PyTorch implementation and models for DINOv3
HY-Motion model for 3D character animation generation
Full stack AI software engineer
Open-source autonomous AI software engineer
⚡ Building applications with LLMs through composability ⚡
A sound cloning tool with a web interface, using your voice
Minimal CLI coding agent by Mistral
A command-line productivity tool powered by AI large language models
Lets make video diffusion practical
Qwen-Image is a powerful image generation foundation model