The Classical Language Toolkit
Python bindings for llama.cpp
NLP Cloud serves high performance pre-trained or custom models for NER
Structured outputs for llms
High-level, high-performance dynamic language for technical computing
Robust Speech Recognition via Large-Scale Weak Supervision
Run Local LLMs on Any Device. Open-source
Port of Facebook's LLaMA model in C/C++
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
A modular, primitive-first, python-first PyTorch library
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
A gradio web UI for running Large Language Models like LLaMA
Powerful AI language model (MoE) optimized for efficiency/performance
Open-source, high-performance AI model with advanced reasoning
Ready-to-use OCR with 80+ supported languages
Vector Database for the next generation of AI applications
LMDeploy is a toolkit for compressing, deploying, and serving LLMs
Telegram Drive
A high-throughput and memory-efficient inference and serving engine
Sparsity-aware deep learning inference runtime for CPUs
Phi-3.5 for Mac: Locally-run Vision and Language Models
Speech-to-text, text-to-speech, and speaker recognition
Powerful tool that lets you create and run intelligent agents
Low-code app builder for RAG and multi-agent AI applications
Full stack AI software engineer