Search all of YouTube from the command line
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Experimental, AI/ML-powered and open sourced Marketing Mix Modeling
Multi-modal large language model designed for audio understanding
State-of-the-art (SoTA) text-to-video pre-trained model
OCR expert VLM powered by Hunyuan's native multimodal architecture
Towards Real-World Vision-Language Understanding
Context-aware AI Sales Agent to automate sales outreach
Enhances Tesseract OCR output using LLMs (local or API)
A collaboration friendly studio for NeRFs
A comprehensive set of fairness metrics for datasets
Open-weight, large-scale hybrid-attention reasoning model
Capable of understanding text, audio, vision, video
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Communicate with an LLM provider using a single interface
A general fine-tuning kit geared toward image/video/audio diffusion
Enables the best performance on NVIDIA RTX Graphics Cards
Code for the paper "Evaluating Large Language Models Trained on Code"
LLM powered fuzzing via OSS-Fuzz
Transform a cold separation into a warm Skill
Run a full local LLM stack with one command using Docker
The book 5 of statistics in simplicity
Create beautiful slides on the web using Claude's frontend skills
AI tool for detecting complex vulnerabilities in Python codebases
An AI for Music Generation