Tokenizer-Free TTS for Multilingual Speech Generation
Context-aware desktop AI assistant that understands screen content
Python inference and LoRA trainer package for the LTX-2 audio–video
Claude Autoresearch Skill, autonomous goal-directed iteration
A Web UI for easy subtitle using whisper model
Automated Penetration Testing Agentic Framework Powered by LLMs
Zero-code platform for building AI agents from natural language input
Multilingual speech recognition and audio understanding model
AI tool for automating desktop tasks via natural language input
A nearly-live implementation of OpenAI's Whisper
RGBD video generation model conditioned on camera input
Library for building type-safe natural language interfaces with LLMs
AI tool for detecting complex vulnerabilities in Python codebases
Set of tools to assess and improve LLM security
Qwen3-TTS is an open-source series of TTS models
The official Node.js / Typescript library for the Groq API
Framework for validating and controlling LLM outputs in AI apps
This repository contains the official implementation of FastVLM
Code for openai.fm, a demo for the OpenAI Speech API
The open source AI research agent
Streaming markdown renderer for AI apps with smooth updates
Ultimate meta-skill for generating best-in-class Claude Code skills
Official inference repo for FLUX.2 models
Customizable AI chat component for websites with API support
Open speech-to-speech models and pipelines by Hugging Face toolkit AI