Doom-based AI research platform for reinforcement learning
Benchmarking Multimodal Agents for Open-Ended Tasks
From nobody to big model (LLM) hero
CV, NLP, LLM project applications, and advanced engineering deployment
Sharing knowledge about big models that everyone can understand
Recipes to train reward model for RLHF
A collection of machine learning examples and tutorials
A curated list of data mining papers about fraud detection
Open Source Immersive Translate
A game theoretic approach to explain the output of ml models
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Proofs, cases, concept supplements, and reference explanations
Constrained Value Alignment via Safe Reinforcement Learning
Fast and memory-efficient exact attention
Kodezi Chronos is a debugging-first language model
Llama Chinese community, real-time aggregation
Witness the aha moment of VLM with less than $3
Efficient few-shot learning with Sentence Transformers
AIGC algorithm engineer interview secrets
A selection of learning materials, search, recommendation, advertising
A modular, primitive-first, python-first PyTorch library
The fast.ai course notebooks
dLLM: Simple Diffusion Language Modeling
Autonomous Agents (LLMs) research papers. Updated Daily
Language Model Reinforcement Learning Environments frameworks