Wan2.2: Open and Advanced Large-Scale Video Generative Model
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Qwen2.5-VL is the multimodal large language model series
Code for the paper Language Models are Unsupervised Multitask Learners
From Images to High-Fidelity 3D Assets
Minimal C implementation for training and inferring Llama 2
A Python toolbox for scalable outlier detection
An experimental version of DeepSeek model
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
3D reconstruction software
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Open source personal AI Assistant for Linux, Windows and Mac
Ready-to-use OCR with 80+ supported languages
Library for OCR-related tasks powered by Deep Learning
Repo of Qwen2-Audio chat & pretrained large audio language model
State-of-the-art TTS model under 25MB
Alfred workflow using ChatGPT, DALL·E 2 and other models for chatting
A set of Docker images for training and serving models in TensorFlow
Implementation of Make-A-Video, new SOTA text to video generator
Streamline your ML workflow
Chinese LLaMA & Alpaca large language model + local CPU/GPU training
Interpretable prompting and models for NLP
Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon