Python tool for converting files and office documents to Markdown
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Code for running inference and finetuning with SAM 3 model
Web interface for generating images using Stable Diffusion models
Audiocraft is a library for audio processing and generation
Persian NLP Toolkit
Label Studio is a multi-type data labeling and annotation tool
Toolkit for conversational AI
Generating Immersive, Explorable, and Interactive 3D Worlds
Open source no-code system for text annotation and building of text
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Qwen-Image is a powerful image generation foundation model
Underthesea - Vietnamese NLP Toolkit
An open-source toolkit for monitoring Language Learning Models (LLMs)
CLIP, Predict the most relevant text snippet given an image
The most accurate natural language detection library for Python
Implementation of Video Diffusion Models
Implementation of Phenaki Video, which uses Mask GIT
Machine learning, conversational dialog engine for creating chat bots
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Data loaders and abstractions for text and NLP
A full spaCy pipeline and models for scientific/biomedical documents
LLM
21 Lessons, Get Started Building with Generative AI