Python tool for converting files and office documents to Markdown
Data and tools for generating and inspecting OLMo pre-training data
ReFT: Representation Finetuning for Language Models
Training data (data labeling, annotation, workflow) for all data types
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
Pretrained model hub for Keras 3
A Python library powered by Language Models (LLMs)
The fastest way to bring multi-agent workflows to production
DoWhy is a Python library for causal inference
Deep universal probabilistic programming with Python and PyTorch
Seamlessly integrate LLMs as Python functions
Models for the spaCy Natural Language Processing (NLP) library
ExtractThinker is a Document Intelligence library for LLMs
Obsei is a low code AI powered automation tool
A full spaCy pipeline and models for scientific/biomedical documents
Database system for building simpler and faster AI-powered application
An AI personal assistant for your digital brain
PandasAI is a Python library that integrates generative AI
Open-source observability for your LLM application
Open source libraries and APIs to build custom preprocessing pipelines
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine
Browse the web, directly from Cursor etc.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
State-of-the-art Parameter-Efficient Fine-Tuning
A Repo For Document AI