LLM
A community-supported supercharged version of paperless
Evaluate and monitor ML models from validation to production
Open source machine learning framework to automate text conversations
Capable of understanding text, audio, vision, video
MTEB: Massive Text Embedding Benchmark
Stanford NLP Python library for many human languages
Implementation of Imagen, Google's Text-to-Image Neural Network
Chat & pretrained large vision language model
tiktoken is a fast BPE tokeniser for use with OpenAI's models
A deep learning toolkit for Text-to-Speech, battle-tested in research
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences
Easy-to-use and powerful NLP library with Awesome model zoo
Obsei is a low code AI powered automation tool
Open source personal AI Assistant for Linux, Windows and Mac
Qwen3-omni is a natively end-to-end, omni-modal LLM
A Unified Framework for Text-to-3D and Image-to-3D Generation
Multimodal-Driven Architecture for Customized Video Generation
text and image to video generation: CogVideoX (2024) and CogVideo
Implementation of Make-A-Video, new SOTA text to video generator
A Model Context Protocol (MCP) server
TextWorld is a sandbox learning environment for the training
SoTA open-source TTS
Open-Sora: Democratizing Efficient Video Production for All
lightweight package to simplify LLM API calls