Synchronized Translation for Videos
Image inpainting tool powered by SOTA AI Model
Translate the video from one language to another and embed dubbing
Simple, Pythonic building blocks to evaluate LLM applications
Unifying 3D Mesh Generation with Language Models
A high-quality PDF to Markdown tool based on large language model
A Python tool to help extracting information from structured PDFs
Enhances Tesseract OCR output using LLMs (local or API)
A community sourced database of game controller mappings
Controllable & emotion-expressive zero-shot TTS
AI-powered code assistant for Vim. OpenAI and ChatGPT plugin for Vim
A Python library for extracting structured information
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences
Unlock the fullest potential of your device
Knowledge Graph Generation from Any Text
LLM abstractions that aren't obstructions
A library to help you make the most out of your Pixoo 64
LLM
Audiocraft is a library for audio processing and generation
Generating Immersive, Explorable, and Interactive 3D Worlds
borb is a library for reading, creating and manipulating PDF files
Create videos with Stable Diffusion
Stable Diffusion WebUI optimized for AMD GPUs with editing tools
Data Infrastructure providing an approach to multimodal AI workloads
Build multimodal language agents for fast prototype and production