SDG is a specialized framework
Synthetic data curation for post-training and data extraction
Multi-Agents LLM Financial Trading Framework
Open Source Deep Research Alternative to Reason and Search
A system for agentic LLM-powered data processing and ETL
Scalable data pre processing and curation toolkit for LLMs
Knowledge Graph Generation from Any Text
An Open-source Framework for Data-centric Language Agents
Bridging LLM and Recommender System
GLM-4 series: Open Multilingual Multimodal Chat LMs
Extension of Google Research’s PaperBanana
An Efficient Web-enhanced Question Answering System
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Empowering Code Generation with OSS-Instruct
Evaluate your LLM's response with Prometheus and GPT4
BISHENG is an open LLM devops platform for next generation apps
LLM training code for MosaicML foundation models
Did you say you like data?
Label, clean and enrich text datasets with LLMs
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)