TT-NN operator library, and TT-Metalium low level kernel programming
Open-weight, large-scale hybrid-attention reasoning model
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Bringing large-language models and chat to web browsers
Run AI models locally on your machine with node.js bindings for llama
Multilingual sentence & image embeddings with BERT
Toolkit for conversational AI
Ling is a MoE LLM provided and open-sourced by InclusionAI
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Open-source LLM load balancer and serving platform for hosting LLMs
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
New set of lightweight state-of-the-art, open foundation models
Diversity-driven optimization and large-model reasoning ability
An ecosystem of Rust libraries for working with large language models
Flagship MoE model for advanced reasoning, coding, and agents
Efficient MoE reasoning model for coding and math workloads