LISA: Reasoning Segmentation via Large Language Model
AI assistant based on large models that can actively think and plan
Implementation of Vision Transformer, a simple way to achieve SOTA
Gemma open-weight LLM library, from Google DeepMind
Deep and Machine Learning for Microscopy
Pretrained model hub for Keras 3
The repository provides code for running inference with SAM 2
Refractoring ChatBot+LLM, Gpt-3.5-turbo, ChatGPT Bot/Voice Assistant
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
A Telegram RSS bot that cares about your reading experience
Datasets, transforms and models specific to Computer Vision
HunyuanVideo: A Systematic Framework For Large Video Generation Model
code for Mesh R-CNN, ICCV 2019
Bring the notion of Model-as-a-Service to life
An extensive node suite that enables ComfyUI to process 3D inputs
PyTorch code and models for the DINOv2 self-supervised learning
Simplest working implementation of Stylegan2
3D reconstruction software
Jittor is a high-performance deep learning framework
The standard data-centric AI package for data quality and ML
A python library for self-supervised learning on images
A simple, high-quality voice conversion tool focused on ease of use
Open-source evaluation toolkit of large multi-modality models (LMMs)
Multimodal embedding and reranking models built on Qwen3-VL
Integrate ChatGPT into your own discord bot