GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
An alignment auditing agent capable of exploring alignment hypothesis
A modular high-level library to train embodied AI agents
A library for scientific machine learning & physics-informed learning
Python & JS/TS SDK for running AI-generated code/code
Agent Zero AI framework
Source code of PyGAD, Python 3 library for building genetic algorithms
Training Large Language Model to Reason in a Continuous Latent Space
Multimodal-Driven Architecture for Customized Video Generation
Low-code framework for building custom LLMs, neural networks
Ray Aviary - evaluate multiple LLMs easily
Chat & pretrained large vision language model
Lightweight Python library for adding real-time multi-object tracking
Toolkit for conversational AI
MCP integration platforms for AI agents to use tools at any scale
Time series forecasting with PyTorch
Deepfakes Software For All
Industrial-level controllable zero-shot text-to-speech system
LLM-based agent for general purpose software engineering tasks
A text-to-speech, speech-to-text and speech-to-speech library
The most reliable AI agent framework that supports MCP
Full stack AI software engineer
Bring the notion of Model-as-a-Service to life
Real-time behaviour synthesis with MuJoCo, using Predictive Control