MoBA: Mixture of Block Attention for Long-Context LLMs
How to optimize some algorithm in cuda
Minimal CLI coding agent by Mistral
A generative speech model for daily dialogue
Paste Markdown and AI responses into Word Excel instantly fast
Framework for building, orchestrating, and deploying AI agents
Containerized automation engine for programmable CI/CD workflows
AI assistant based on large models that can actively think and plan
Accurate × Fast × Comprehensive
A high-quality rapid TTS voice cloning model
Open-Source Financial Large Language Models
The Open Source Cowork Desktop to Unlock Your Exceptional Productivity
Reverse-engineered Python API for Google Gemini web app
The simplest, fastest repository for training/finetuning models
A middleware to provide an openAI compatible endpoint
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
AI bridge enabling assistants to control and automate Unity Editor
Framework for building AI-powered interactive digital humans and agent
An open phone agent model & framework
Contexts Optical Compression
Inference Llama 2 in one file of pure C
Provides convenient access to the Anthropic REST API from any Python 3
A high-quality tool for convert PDF to Markdown and JSON
Opensource browser using agents
A sound cloning tool with a web interface, using your voice