Replace OpenAI GPT with another LLM in your app
Training data (data labeling, annotation, workflow) for all data types
A chatbot built based on a large model
Video understanding codebase from FAIR for reproducing video models
The no-nonsense RAG chunking library
Persian NLP Toolkit
Repo of Qwen2-Audio chat & pretrained large audio language model
Ready-to-use OCR with 80+ supported languages
Conversational voice AI agents
An open and fair framework for everyone to build AI agents
Semantic search and workflows for medical/scientific papers
An open source object detection toolbox based on PyTorch
StreamSpeech is a seamless model for offline speech recognition
A framework to enable multimodal models to operate a computer
Chat & pretrained large vision language model
Build voice-based LLM agents. Modular + open source
Data manipulation and transformation for audio signal processing
The behavior guidance framework for customer-facing LLM agents
Models for the spaCy Natural Language Processing (NLP) library
Capable of understanding text, audio, vision, video
Real-time voice interactive digital human
Industrial-strength Natural Language Processing (NLP)
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Pre-trained Deep Learning models and demos
Stanford NLP Python library for many human languages