The Classical Language Toolkit
Python bindings for llama.cpp
NLP Cloud serves high performance pre-trained or custom models for NER
Structured outputs for llms
High-level, high-performance dynamic language for technical computing
Robust Speech Recognition via Large-Scale Weak Supervision
Run Local LLMs on Any Device. Open-source
Port of Facebook's LLaMA model in C/C++
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
A modular, primitive-first, python-first PyTorch library
A gradio web UI for running Large Language Models like LLaMA
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Powerful AI language model (MoE) optimized for efficiency/performance
Vector Database for the next generation of AI applications
Open-source, high-performance AI model with advanced reasoning
Telegram Drive
Phi-3.5 for Mac: Locally-run Vision and Language Models
A high-throughput and memory-efficient inference and serving engine
Sparsity-aware deep learning inference runtime for CPUs
Speech-to-text, text-to-speech, and speaker recognition
Powerful tool that lets you create and run intelligent agents
Low-code app builder for RAG and multi-agent AI applications
Full stack AI software engineer
Generate short videos with one click using AI LLM
Finding the Scaling Law of Agents. A multi-agent framework