SDG is a specialized framework
Synthetic data curation for post-training and data extraction
Open Source Deep Research Alternative to Reason and Search
Learn AI and LLMs from scratch using free resources
The platform for LLM evaluations and AI agent testing
A system for agentic LLM-powered data processing and ETL
Scalable data pre processing and curation toolkit for LLMs
Knowledge Graph Generation from Any Text
CLI proxy that reduces LLM token consumption
An Open-source Framework for Data-centric Language Agents
Bridging LLM and Recommender System
Extension of Google Research’s PaperBanana
An Efficient Web-enhanced Question Answering System
GLM-4 series: Open Multilingual Multimodal Chat LMs
Course to get into Large Language Models (LLMs)
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
A powerful tool for creating datasets for LLM fine-tuning
Evaluate your LLM's response with Prometheus and GPT4
Empowering Code Generation with OSS-Instruct
BISHENG is an open LLM devops platform for next generation apps
Extract and convert data from any document, images, pdfs, word doc
Curated list of datasets and tools for post-training
LLM training code for MosaicML foundation models
Did you say you like data?
The TypeScript framework for AI development