Qwen2.5-VL is the multimodal large language model series
Gen-AI Chat for Teams
LMDeploy is a toolkit for compressing, deploying, and serving LLMs
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
Neural Network Compression Framework for enhanced OpenVINO
Replace OpenAI GPT with another LLM in your app
A unified interface for distributed computing
Investment Research for Everyone, Everywhere
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML
The unofficial python package that returns response of Google Bard
A Repo For Document AI
Build a machine learning model from a prompt
Inference Llama 2 in one file of pure C
Python chatbot framework with Natural Language Understanding
A Python package for extending the official PyTorch
Self-Modifying Framework from the Future
Seamlessly integrate LLMs as Python functions
Refer and Ground Anything Anywhere at Any Granularity
Conversational voice AI agents
Efficient few-shot learning with Sentence Transformers
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments
Framework to easily create LLM powered bots over any dataset
Tool for exploring and debugging transformer model behaviors
Finding the Scaling Law of Agents. A multi-agent framework
Elyra extends JupyterLab with an AI centric approach