GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Python SDK for the Computer Use model Lux, developed by OpenAGI
Experimental, AI/ML-powered and open sourced Marketing Mix Modeling
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
A minimal yet professional single agent demo project
Real-time voice interactive digital human
An Open Source text-to-speech system built by inverting Whisper
Official MiniMax Model Context Protocol (MCP) server
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Converts text to speech in realtime
Scalable machine learning for time series forecasting
On-device Speech-to-Intent engine powered by deep learning
Benchmarking synthetic data generation methods
AIMET is a library that provides advanced quantization and compression
Implementation of 'lightweight' GAN, proposed in ICLR 2021
Powering Amazon custom machine learning chips
An advanced paper search agent powered by large language models
GUI Exploration Lab. One of the best GUI agent solutions
Open-weight, large-scale hybrid-attention reasoning model
Large-language-model & vision-language-model based on Linear Attention
Free, high-quality text-to-speech API endpoint to replace OpenAI
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
A python library for easy manipulation and forecasting of time series
BitNet: Scaling 1-bit Transformers for Large Language Models
Capable of understanding text, audio, vision, video