Multimodal Agents as Smartphone Users, an LLM-based multimodal agent
95% token savings. 155x faster queries. 16 languages
A best practices guide for day 2 operations
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
The ChatGPT Retrieval Plugin lets you easily find personal documents
ChatGPT extension for scientific research work
A wiki system with complex functionality for simple integration
Next generation AWS IoT Client SDK for Python
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
A minimal yet professional single agent demo project
Python implementation of TextRank algorithms
Set styles to words and create a Table of Contents in a click
Thin wrapper for "pandoc" (MIT)
Open-source tool to visualise your RAG
Mega Snap Merge v9.2.2 – Free desktop tool
Installable / Portable Python Distribution for Everyone.
Convert files like docx, xlsx, pptx, html, and more to MarkDown
Ship RAG based LLM web apps in seconds
FaceOnLive Open KYC: Streamlining Identity Verification with AI
High performance datastore for time series and tick data
Audio Transcription software for Linux (Vlc) with a foot pedal
PDF Utility is a tool designed to efficiently manipulate PDF files