Multilingual Document Layout Parsing in a Single Vision-Language Model
Search all of YouTube from the command line
Download pictures (or videos) along with their captions
The recursive internet scanner for hackers
All-in-one AI framework & toolkit for Claude Code & Cursor
Python SDK for Claude Agent
An anomaly detection library comprising state-of-the-art algorithms
Research-oriented chatbot framework
Towards Human-Sounding Speech
Get started w/ building Fullstack Agents using Gemini 2.5 & LangGraph
A state-of-the-art open visual language model
AI multi-agent platform for automated code security auditing system
AI-Powered Data Processing: Use LOTUS to process all of your datasets
Fully Local Manus AI. No APIs, No $200 monthly bills
12 Weeks, 24 Lessons, AI for All
New family of code large language models (LLMs)
A Decky Loader plugin that brings together games
Democratizing Reinforcement Learning for LLMs
Image-to-Image Translation in PyTorch
Positron, a next-generation data science IDE
GPT4V-level open-source multi-modal model based on Llama3-8B
The only cheat sheet you need
A set of utilities for monitoring and customizing GPU performance
Free and open-source digital preservation system
Pytorch domain library for recommendation systems