Inference Llama 2 in one file of pure C
Adding guardrails to large language models
Multilingual sentence & image embeddings with BERT
This repository provides an advanced RAG
An LLM-powered knowledge curation system that researches topics
Large-scale Self-supervised Pre-training Across Tasks, Languages, etc.
Qwen3-Coder is the code version of Qwen3
Advanced techniques for RAG systems
Integrating LLMs into structured NLP pipelines
A high-performance ML model serving framework, offers dynamic batching
Framework that is dedicated to making neural data processing
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Train a 26M-parameter GPT from scratch in just 2h
Research-oriented chatbot framework
File Parser optimised for LLM Ingestion with no loss
Framework to easily create LLM powered bots over any dataset
Set of tools to assess and improve LLM security
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Technical principles related to large models
LLM abstractions that aren't obstructions
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible
Open source libraries and APIs to build custom preprocessing pipelines
CogView4, CogView3-Plus and CogView3(ECCV 2024)
BISHENG is an open LLM devops platform for next generation apps
Central interface to connect your LLM's with external data