Translate the video from one language to another and embed dubbing
ComfyUI wrapper nodes for HunyuanVideo
A Sublime Text 2/3 plugin to see git diff in gutter
A high-quality PDF to Markdown tool based on large language model
Enhances Tesseract OCR output using LLMs (local or API)
A community sourced database of game controller mappings
Simple, Pythonic building blocks to evaluate LLM applications
Unifying 3D Mesh Generation with Language Models
A Python tool to help extracting information from structured PDFs
Unlock the fullest potential of your device
Controllable & emotion-expressive zero-shot TTS
AI-powered code assistant for Vim. OpenAI and ChatGPT plugin for Vim
A Python library for extracting structured information
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences
LLM abstractions that aren't obstructions
Knowledge Graph Generation from Any Text
Generating Immersive, Explorable, and Interactive 3D Worlds
Qwen-Image is a powerful image generation foundation model
Stable Diffusion WebUI optimized for AMD GPUs with editing tools
Data Infrastructure providing an approach to multimodal AI workloads
Build multimodal language agents for fast prototype and production
LaTeX source and supporting code for Think Python, 2nd edition
21 Lessons, Get Started Building with Generative AI
borb is a library for reading, creating and manipulating PDF files
Han Language Processing