A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Image polygonal annotation with Python
On-device Speech-to-Intent engine powered by deep learning
AI Toolkit for Healthcare Imaging
An MCP server for interacting with Google Colab
Open-source platform for building enterprise-grade agents
An open-source RAG-based tool for chatting with your documents
Superfast AI decision making and processing of multi-modal data
Collection of cybersecurity-related references, scripts, tools, code
Efficient Retrieval Augmentation and Generation Framework
An unsupervised and free tool for image and video dataset analysis
Chat with your SQL database
Trained models & code to predict toxic comments
ReFT: Representation Finetuning for Language Models
AI coding assistant skill (Claude Code, Codex, OpenCode, OpenClaw)
Package and deploy machine learning models using Docker containers
An alignment auditing agent capable of exploring alignment hypothesis
A minimal yet professional single agent demo project
Claude Code skill that researches any topic across Reddit + X
Build GenAI application quick and easy
AI-powered code generation tool for scratch development of web apps
Create Customized Software using Natural Language Idea
Real-time multi-AI collaboration: Claude, Codex & Gemini
Implement a concise and clear Deep Search Agent from 0
Minimal CLI coding agent by Mistral