Python tool for converting files and office documents to Markdown
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Web interface for generating images using Stable Diffusion models
Code for running inference and finetuning with SAM 3 model
Label Studio is a multi-type data labeling and annotation tool
Audiocraft is a library for audio processing and generation
Persian NLP Toolkit
Toolkit for conversational AI
Generating Immersive, Explorable, and Interactive 3D Worlds
Open source no-code system for text annotation and building of text
Qwen-Image is a powerful image generation foundation model
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Underthesea - Vietnamese NLP Toolkit
An open-source toolkit for monitoring Language Learning Models (LLMs)
CLIP, Predict the most relevant text snippet given an image
The most accurate natural language detection library for Python
Library for OCR-related tasks powered by Deep Learning
Implementation of Video Diffusion Models
Implementation of Phenaki Video, which uses Mask GIT
Machine learning, conversational dialog engine for creating chat bots
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Data loaders and abstractions for text and NLP
21 Lessons, Get Started Building with Generative AI
A full spaCy pipeline and models for scientific/biomedical documents