Build AI-powered semantic search applications
A multi-function Discord bot
A Python toolbox for scalable outlier detection
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Python SDK for the Computer Use model Lux, developed by OpenAGI
Experimental, AI/ML-powered and open sourced Marketing Mix Modeling
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
LLM-based agent for general purpose software engineering tasks
High-Fidelity and Controllable Generation of Textured 3D Assets
Multi-modal large language model designed for audio understanding
A minimal yet professional single agent demo project
Real-time voice interactive digital human
Concatenate a directory full of files into a single prompt
OCR expert VLM powered by Hunyuan's native multimodal architecture
Scalable machine learning for time series forecasting
On-device Speech-to-Intent engine powered by deep learning
Benchmarking synthetic data generation methods
Implementation of 'lightweight' GAN, proposed in ICLR 2021
Powering Amazon custom machine learning chips
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
An advanced paper search agent powered by large language models
LLM-based Reinforcement Learning audio edit model
GUI Exploration Lab. One of the best GUI agent solutions
Open-weight, large-scale hybrid-attention reasoning model
Large-language-model & vision-language-model based on Linear Attention