An open sourced end-to-end VLM-based GUI Agent
AI tool for detecting complex vulnerabilities in Python codebases
Set of tools to assess and improve LLM security
Qwen3-TTS is an open-source series of TTS models
TFDS is a collection of datasets ready to use with TensorFlow,
Taming Stable Diffusion for Lip Sync
GPT4V-level open-source multi-modal model based on Llama3-8B
Large Audio Language Model built for natural interactions
Ultimate meta-skill for generating best-in-class Claude Code skills
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Open-source framework for intelligent speech interaction
Official inference repo for FLUX.2 models
Magnetoencephalography (MEG) and Electroencephalography EEG in Python
GUI Exploration Lab. One of the best GUI agent solutions
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
E2M converts various file types (doc, docx, epub, html, htm, url
The Cradle framework is a first attempt at General Computer Control
The Security Toolkit for LLM Interactions
Code and models for ICML 2024 paper, NExT-GPT
Inference script for Oasis 500M
Framework for building neural networks
This repository contains the official implementation of FastVLM
Aider is AI pair programming in your terminal
Practical productivity tools for Claude Code, Codex-CLI
RGBD video generation model conditioned on camera input