Provides code for running inference with the SegmentAnything Model
A simple, open format for guiding coding agents
A Family of Open Foundation Models for Code Intelligence
Easily turn large sets of image urls to an image dataset
Use Microsoft Edge's online text-to-speech service from Python
One-click deployment (including offline integration package)
A single Gradio + React WebUI with extensions for ACE-Step
The python library for real-time communication
Towards Human-Level Text-to-Speech through Style Diffusion
A TTS model capable of generating ultra-realistic dialogue
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Renderer for the harmony response format to be used with gpt-oss
Large-language-model & vision-language-model based on Linear Attention
Framework for building neural networks
StreamSpeech is a seamless model for offline speech recognition
Implementation of Vision Transformer, a simple way to achieve SOTA
This repository contains the official implementation of FastVLM
Inference code for CodeLlama models
PyTorch code and models for the DINOv2 self-supervised learning
Claude Code action for GitHub PRs
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
Official implementation of DreamCraft3D
Transformers4Rec is a flexible and efficient library
Visual Studio Code client for Tabnine
MARS5 speech model (TTS) from CAMB.AI