Gracefully face hCaptcha challenge with multimodal llms
From Paper to Presentation in One Click
State-of-the-art (SoTA) text-to-video pre-trained model
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Koog is the official Kotlin framework for building AI agents
Open-source SQL AI Agent for Text-to-SQL. Make Text2SQL Easy
Open-source platform for evaluating, observing, and improving LLM
Multi-source content processor for NotebookLM
Chat with multiple PDFs locally
Claude Autoresearch Skill, autonomous goal-directed iteration
Autonomous experiment loop extension for pi
Fast State-of-the-Art Static Embeddings
MCP server for interfacing with Godot game engine
Reflexion: Language Agents with Verbal Reinforcement Learning
Semantic search and document parsing tools for the command line
Package and deploy machine learning models using Docker containers
Unsupervised Learning for Image Registration
Data Science Roadmap from A to Z
Machine Learning Containers for NVIDIA Jetson and JetPack-L4T
Build multimodal AI applications with cloud-native stack
Standards for building agents, better
Visual intelligence for your home.
AI-powered tool for efficient abstract and PDF screening
AI-driven multi-agent research assistant automating hypothesis
From nobody to big model (LLM) hero