LLM training code for MosaicML foundation models
Data Lake for Deep Learning. Build, manage, and query datasets
Multilingual sentence & image embeddings with BERT
Request recommended movies, TV shows and anime to Jellyseer/Overseer
Diversity-driven optimization and large-model reasoning ability
This repository provides an advanced RAG
Inference code for CodeLlama models
Tensor search for humans
Concatenate a directory full of files into a single prompt
Open-weight, large-scale hybrid-attention reasoning model
Large-language-model & vision-language-model based on Linear Attention
Capable of understanding text, audio, vision, video
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Official Repo for ICML 2024 paper
Large-scale Self-supervised Pre-training Across Tasks, Languages, etc.
Committed to building an open, public welfare
A Pioneering Open-Source Alternative to GPT-4o
Evals is a framework for evaluating LLMs and LLM systems
LLM powered fuzzing via OSS-Fuzz
PyTorch library of curated Transformer models and their components
Automatic question answering for local knowledge bases based on LLM
Unify Efficient Fine-tuning of RAG Retrieval, including Embedding
Chat & pretrained large audio language model proposed by Alibaba Cloud
Chat & pretrained large vision language model
Retrieval Augmented Generation (RAG) framework