When LLM Meets Domain Experts
ChatGPT interface with better UI
Chat-based assistant that understands tasks
Machine Learning automation and tracking
High-Fidelity and Controllable Generation of Textured 3D Assets
Create UIs for your machine learning model in Python in 3 minutes
Guiding Instruction-based Image Editing via Multimodal Large Language
OCR expert VLM powered by Hunyuan's native multimodal architecture
SOTA Open Source TTS
A Universal Customization Method for Single and Multi Conditioning
LLM powered fuzzing via OSS-Fuzz
MiniSom is a minimalistic implementation of the Self Organizing Maps
21 Lessons, Get Started Building with Generative AI
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
A python library for self-supervised learning on images
Omnilingual ASR Open-Source Multilingual SpeechRecognition
PPTAgent: Generating and Evaluating Presentations
Implements weak-to-strong learning for training stronger ML models
OpenLIT is an open-source LLM Observability tool
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
AI discovers 520000 stable inorganic crystal structures for research
MCP integration platforms for AI agents to use tools at any scale
Implementation of Vision Transformer, a simple way to achieve SOTA
Set of tools to assess and improve LLM security
The repository provides code for running inference with SAM 2