Enhances Tesseract OCR output using LLMs (local or API)
Code and models for ICML 2024 paper, NExT-GPT
Extract audio and video content and organize it into a Markdown note
StreamSpeech is a seamless model for offline speech recognition
Personal mini-web in text
A sound cloning tool with a web interface, using your voice
HTML Loader
StarVector is a foundation model for SVG generation
Foundational model for human-like, expressive TTS
Python tool for crawling and extracting structured data from news site
Flowly is 100x faster than OpenClaw
Evaluate and monitor ML models from validation to production
Industrial-strength Natural Language Processing (NLP)
A lightweight approach to removing Google web service dependency
Capable of understanding text, audio, vision, video
Python framework for adversarial attacks, and data augmentation
Sample code and notebooks for Generative AI on Google Cloud
Network analysis in Python
Extract one time password (OTP) secrets from QR codes
Create prompt-friendly codebase digests from any Git repository URL
A Python library for extracting structured information
GenAI Processors is a lightweight Python library
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences
LLM abstractions that aren't obstructions
Full-text IPFS-friendly and WASM-compatible Search in Rust