A Powerful Native Multimodal Model for Image Generation
PDF to Markdown with vision models
Instagram OSINT tool for gathering profile data and public posts
Open source personal AI Assistant for Linux, Windows and Mac
TextWorld is a sandbox learning environment for the training
Toolkit for conversational AI
Framework for building realtime multimodal voice AI agents apps
Open source machine learning framework to automate text conversations
The Python code to reproduce illustrations from Machine Learning Book
Math OCR model that outputs LaTeX and markdown
Knowledge Graph Generation from Any Text
Knowledge Agents and Management in the Cloud
A library to help you make the most out of your Pixoo 64
Easy to use Python library for creating 2D arcade games
LLM
Stable Diffusion WebUI optimized for AMD GPUs with editing tools
Deep Research framework, combining language models with tools
Multimodal-Driven Architecture for Customized Video Generation
Zero-copy PDF text extraction library written in Zig
Tools to ease the creation of snippets, syntax definitions, etc.
Han Language Processing
Generate blog articles from video or audio
MTEB: Massive Text Embedding Benchmark
Implementation of Imagen, Google's Text-to-Image Neural Network
Stanford NLP Python library for many human languages