A simple but complete full-attention transformer
Generate Any 3D Scene in Seconds
Pretrained model hub for Keras 3
DeepVariant is an analysis pipeline that uses a deep neural networks
A Telegram RSS bot that cares about your reading experience
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
A set of Docker images for training and serving models in TensorFlow
PyTorch code and models for V-JEPA self-supervised learning from video
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
The repository provides code for running inference with SAM 2
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Refractoring ChatBot+LLM, Gpt-3.5-turbo, ChatGPT Bot/Voice Assistant
Open source framework for deep learning satellite and aerial imagery
A fast, powerful, and simple hierarchical vision transformer
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Deep and Machine Learning for Microscopy
Integrate ChatGPT into your own discord bot
Python SDK for the Computer Use model Lux, developed by OpenAGI
Document Image Parsing via Heterogeneous Anchor Prompting”
Framework for building neural networks
Refer and Ground Anything Anywhere at Any Granularity
Machine Learning Pipelines for Kubeflow
The unofficial python package that returns response of Google Bard
Geometric deep learning extension library for PyTorch
Large-language-model & vision-language-model based on Linear Attention