Towards Real-World Vision-Language Understanding
AI agent microservice
Convert codebases into structured prompts optimized for LLM analysis
AI multi-agent framework for automating data-driven R&D workflows
Your Personal Research Multi-Tool
CV, NLP, LLM project applications, and advanced engineering deployment
One API call, pull Claude agent, completely sandboxed
Implementation of Vision Transformer, a simple way to achieve SOTA
Refer and Ground Anything Anywhere at Any Granularity
Foundation Models for Time Series
Self-supervised visual learning using momentum contrast in PyTorch
Easy-to-use,Modular and Extendible package of deep-learning models
Asynchronous multi-platform robot framework written in Python
Large-language-model & vision-language-model based on Linear Attention
AI-Powered Data Processing: Use LOTUS to process all of your datasets
Unified Multimodal Understanding and Generation Models
Python framework for AI workflows and pipelines with chain of thought
Easy-to-use and powerful NLP library with Awesome model zoo
A cross-platform Python library for differentiable programming
Best practices on recommendation systems
Bailing is a voice dialogue robot similar to GPT-4o
Plug-and-play library to enable agents to call MCP and UTCP tools
Open source framework for deep learning satellite and aerial imagery
A python library for self-supervised learning on images
Open Source Differentiable Computer Vision Library