A gradio web UI for running Large Language Models like LLaMA
The scientific Python development environment
OCRmyPDF adds an OCR text layer to scanned PDF files
InvokeAI is a leading creative engine for Stable Diffusion models
Open-Source Python3 tool for recognizing layouts, tables, and math
Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.
Open source personal AI Assistant for Linux, Windows and Mac
Label Studio is a multi-type data labeling and annotation tool
Transforming Multimodal Content into Captivating Multilingual Audio
Crowdsourcing platform for full text transcription and tagging
Python bindings for MuPDF's rendering library.
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis
Implementation of Imagen, Google's Text-to-Image Neural Network
Math OCR model that outputs LaTeX and markdown
Stable Diffusion built-in to Blender
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Network analysis in Python
Implementation of Phenaki Video, which uses Mask GIT
Parse files for optimal RAG
An open source implementation of CLIP
The Markdown Editor for Linux
Alfred workflow using ChatGPT, DALL·E 2 and other models for chatting
State-of-the-art diffusion models for image and audio generation
OpenMMLab Text Detection, Recognition and Understanding Toolbox
Implementation of Make-A-Video, new SOTA text to video generator