Python bindings for llama.cpp
Official repository for LTX-Video
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Inference framework for 1-bit LLMs
LTX-Video Support for ComfyUI
Contexts Optical Compression
The official repo of Qwen chat & pretrained large language model
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
Open-source multi-speaker long-form text-to-speech model
Video understanding codebase from FAIR for reproducing video models
Sharp Monocular Metric Depth in Less Than a Second
Official implementation of DreamCraft3D
Large Multimodal Models for Video Understanding and Editing
Diffusion Transformer with Fine-Grained Chinese Understanding
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Large-language-model & vision-language-model based on Linear Attention
Dataset of GPT-2 outputs for research in detection, biases, and more
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
A Conversational Speech Generation Model
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Detect faces in an image
Encoder of greater-than-word length text trained on a variety of data
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)
CTC-based forced aligner for audio-text in 158 languages