Reference PyTorch implementation and models for DINOv3
⚡ Building applications with LLMs through composability ⚡
A sound cloning tool with a web interface, using your voice
ChatGLM-6B: An Open Bilingual Dialogue Language Model
A natural language interface for computers
No fortress, purely open ground. OpenManus is Coming
A python tool that uses GPT-4, FFmpeg, and OpenCV
Lets make video diffusion practical
A community-supported supercharged version of paperless
Qwen2.5-VL is the multimodal large language model series
State-of-the-art TTS model under 25MB
Speech recognition module for Python
An experimental version of DeepSeek model
A Powerful Native Multimodal Model for Image Generation
Powerful tool that lets you create and run intelligent agents
Image polygonal annotation with Python
Qwen-Image is a powerful image generation foundation model
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Label Studio is a multi-type data labeling and annotation tool
Minimal CLI coding agent by Mistral
Industrial-level controllable zero-shot text-to-speech system
The official repo of Qwen chat & pretrained large language model
Easily turn large sets of image urls to an image dataset
Open-source autonomous AI software engineer
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)