ExtractThinker is a Document Intelligence library for LLMs
Unified embedding model
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models
AI video agents framework for next-gen video interactions
A tool for learning vector representations of words and entities
Trained models & code to predict toxic comments
Data and tools for generating and inspecting OLMo pre-training data
Obsei is a low code AI powered automation tool
Persian NLP Toolkit
Fast and customizable framework for automatic ML model creation
WikiChat is an improved RAG
Efficient Retrieval Augmentation and Generation Framework
A Heterogeneous Benchmark for Information Retrieval
Libraries for applying sparsification recipes to neural networks
The no-nonsense RAG chunking library
Sparsity-aware deep learning inference runtime for CPUs
Allows you to maintain all the necessary cruft for building projects
LLM based data scientist, AI native data application
FastAPI server-side rendering with built-in HTMX support.
Node repackaging(wrapping) of the LLVM Clang's clang-format
Tools like web browser, computer access and code runner for LLMs
Mist is an open source, multicloud management platform
pg_activity is a top like application for PostgreSQL server activity
ReFT: Representation Finetuning for Language Models
ROS packages for Turtlebot3