Check code for common misspellings
Low-latency REST API for serving text-embeddings
The open-source data curation platform for LLMs
The data structure for multimodal data
Collect your thoughts and notes without leaving the command line
Build AI-powered semantic search applications
Open source terminal session recorder
Framework that is dedicated to making neural data processing
MII makes low-latency and high-throughput inference possible
Visualize and compare datasets, target values and associations
Transformers4Rec is a flexible and efficient library
Code review for data in dbt
The standard data-centric AI package for data quality and ML
Synthetic data generators for structured and unstructured text
Easy-to-use Speech Toolkit including Self-Supervised Learning model
Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy
Proofs, cases, concept supplements, and reference explanations
Build cross-modal and multimodal applications on the cloud
Open source annotation tool for machine learning practitioners
Task-oriented finetuning for better embeddings on neural search
Chinese LLaMA & Alpaca large language model + local CPU/GPU training
OpenMMLab Text Detection, Recognition and Understanding Toolbox
Implementation of "Tree of Thoughts
Dealing with all unstructured data, such as reverse image search
State-of-the-art Multilingual Question Answering research