Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
LLM-based agent for general purpose software engineering tasks
High-Fidelity and Controllable Generation of Textured 3D Assets
Multi-modal large language model designed for audio understanding
Open-source framework for intelligent speech interaction
OCR expert VLM powered by Hunyuan's native multimodal architecture
Scalable machine learning for time series forecasting
On-device Speech-to-Intent engine powered by deep learning
Benchmarking synthetic data generation methods
Making Enterprise Data Intelligent and Responsive for AI
AIMET is a library that provides advanced quantization and compression
Powering Amazon custom machine learning chips
An advanced paper search agent powered by large language models
LLM-based Reinforcement Learning audio edit model
GUI Exploration Lab. One of the best GUI agent solutions
Open-weight, large-scale hybrid-attention reasoning model
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Python 3 package for easy bypass reCAPTCHA/reCAPTCHA Mobile/hCaptcha
Library for training machine learning models with privacy for data
Toolkit for audio, music, and speech generation
Images to inference with no labeling
Machine Learning Systems: Design and Implementation
Implementation of Recurrent Interface Network (RIN)
Open platform for training, serving, and evaluating language models
AI agent that streamlines the entire process of data analysis