Wan2.2: Open and Advanced Large-Scale Video Generative Model
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Qwen2.5-VL is the multimodal large language model series
Code for the paper Language Models are Unsupervised Multitask Learners
From Images to High-Fidelity 3D Assets
Inference Llama 2 in one file of pure C
A Python toolbox for scalable outlier detection
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
3D reconstruction software
Open source personal AI Assistant for Linux, Windows and Mac
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Ready-to-use OCR with 80+ supported languages
Library for OCR-related tasks powered by Deep Learning
Repo of Qwen2-Audio chat & pretrained large audio language model
State-of-the-art TTS model under 25MB
A set of Docker images for training and serving models in TensorFlow
Implementation of Make-A-Video, new SOTA text to video generator
Chinese LLaMA & Alpaca large language model + local CPU/GPU training
Alfred workflow using ChatGPT, DALL·E 2 and other models for chatting
Fast inference engine for Transformer models
Streamline your ML workflow
Interpretable prompting and models for NLP
Python framework for AI workflows and pipelines with chain of thought