Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
VITS2 backbone with multilingual-bert
Open source codebase for Scale Agentex
An alignment auditing agent capable of exploring alignment hypothesis
The repository provides code for running inference with SAM 2
Audiocraft is a library for audio processing and generation
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
The ChatGPT Retrieval Plugin lets you easily find personal documents
Official python implementation of UTCP. UTCP is an open standard
A modular high-level library to train embodied AI agents
A library for scientific machine learning & physics-informed learning
Training data (data labeling, annotation, workflow) for all data types
A refreshing functional take on deep learning
A minimal yet professional single agent demo project
Converts text to speech in realtime
Diversity-driven optimization and large-model reasoning ability
Deploy and share agents with open infrastructure
Chat & pretrained large vision language model
Refractoring ChatBot+LLM, Gpt-3.5-turbo, ChatGPT Bot/Voice Assistant
Open source framework for deep learning satellite and aerial imagery
Toolkit for conversational AI
A unified framework for scalable computing
Stanford NLP Python library for many human languages
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
LLM-based agent for general purpose software engineering tasks