Python inference and LoRA trainer package for the LTX-2 audio–video
Code for openai.fm, a demo for the OpenAI Speech API
Automated Penetration Testing Agentic Framework Powered by LLMs
RGBD video generation model conditioned on camera input
A nearly-live implementation of OpenAI's Whisper
Large Audio Language Model built for natural interactions
An experimental version of DeepSeek model
This repository contains the official implementation of FastVLM
Official inference repo for FLUX.2 models
Ultimate meta-skill for generating best-in-class Claude Code skills
Inference script for Oasis 500M
Framework for building neural networks
Set of tools to assess and improve LLM security
The official Node.js / Typescript library for the Groq API
Anthony Fu's curated collection of agent skills
Recovering the Visual Space from Any Views
A python tool that uses GPT-4, FFmpeg, and OpenCV
Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI
From Paper to Presentation in One Click
Real-World Centric Foundation GUI Agents
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Central interface to connect your LLM's with external data
Project showcasing Llama 3.3 70B HTML codegen abilities
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
A neural network that transforms a design mock-up into static websites