Tooling for the Common Objects In 3D dataset
LTX-Video Support for ComfyUI
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
DeepSeek Coder: Let the Code Write Itself
Qwen3-Coder is the code version of Qwen3
A Powerful Native Multimodal Model for Image Generation
Sharp Monocular Metric Depth in Less Than a Second
Foundation Models for Time Series
This repository contains the official implementation of FastVLM
The official PyTorch implementation of Google's Gemma models
Official code for Style Aligned Image Generation via Shared Attention
Code for the paper Hybrid Spectrogram and Waveform Source Separation
This repository contains the official implementation of research
Fine-tuning ChatGLM-6B with PEFT
Reproduces results of "Fixing the train-test resolution discrepancy"
A library for Multilingual Unsupervised or Supervised word Embeddings
Code for the paper "Improved Techniques for Training GANs"