INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
A modular graph-based Retrieval-Augmented Generation (RAG) system
Qwen3 is the large language model series developed by Qwen team
Interact with your documents using the power of GPT
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
BISHENG is an open LLM devops platform for next generation apps
An elegent pytorch implement of transformers
Access large language models from the command-line
Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Powerful AI language model (MoE) optimized for efficiency/performance
LLM based data scientist, AI native data application
Tools like web browser, computer access and code runner for LLMs
LLM
A high-performance ML model serving framework, offers dynamic batching
File Parser optimised for LLM Ingestion with no loss
Open-source observability for your LLM application
PandasAI is a Python library that integrates generative AI
User toolkit for analyzing and interfacing with Large Language Models
Large-scale Self-supervised Pre-training Across Tasks, Languages, etc.
Framework and no-code GUI for fine-tuning LLMs
Open-source, high-performance AI model with advanced reasoning
Train a 26M-parameter GPT from scratch in just 2h
AI agent that streamlines the entire process of data analysis
Technical principles related to large models