High-Resolution Image Synthesis with Latent Diffusion Models
Phi-3.5 for Mac: Locally-run Vision and Language Models
A PyTorch library for implementing flow matching algorithms
Renderer for the harmony response format to be used with gpt-oss
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
CodeGeeX2: A More Powerful Multilingual Code Generation Model
FAIR Sequence Modeling Toolkit 2
DeepSeek Coder: Let the Code Write Itself
Official DeiT repository
Example Discord bot written in Python that uses the completions API
The ChatGPT Retrieval Plugin lets you easily find personal
A Unified Framework for Text-to-3D and Image-to-3D Generation
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Open-source large language model family from Tencent Hunyuan
Multimodal Diffusion with Representation Alignment
Repo of Qwen2-Audio chat & pretrained large audio language model
High-resolution models for human tasks
Capable of understanding text, audio, vision, video
Qwen2.5-VL is the multimodal large language model series
VMZ: Model Zoo for Video Modeling
Official implementation of Watermark Anything with Localized Messages
Tooling for the Common Objects In 3D dataset
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Language modeling in a sentence representation space
An AI-powered security review GitHub Action using Claude