Multilingual sentence & image embeddings with BERT
Efficient few-shot learning with Sentence Transformers
Integrating LLMs into structured NLP pipelines
Implementation of AudioLM audio generation model in Pytorch
Implementation of "MobileCLIP" CVPR 2024
Supercharge Your LLM with the Fastest KV Cache Layer
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Towards Real-World Vision-Language Understanding
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
The open-source data curation platform for LLMs
An open source implementation of CLIP
A Model Context Protocol server for searching and analyzing arXiv
Refer and Ground Anything Anywhere at Any Granularity
TorchMultimodal is a PyTorch library
Deterministic LLMs Outputs for AI Applications and AI Agents
LLM training code for MosaicML foundation models
Integrate ChatGPT into your own discord bot
Open source no-code system for text annotation and building of text
Implementation of Imagen, Google's Text-to-Image Neural Network
Tools to ease the creation of snippets, syntax definitions, etc.
Simple, Pythonic building blocks to evaluate LLM applications
Stanford NLP Python library for many human languages
Chat & pretrained large audio language model proposed by Alibaba Cloud
Implementation of Phenaki Video, which uses Mask GIT
Implementation of Video Diffusion Models