Easy-to-use Speech Toolkit including Self-Supervised Learning model
Open source NLP guide with models, methods, and real use cases
Repo of Qwen2-Audio chat & pretrained large audio language model
Traditional Mandarin LLMs for Taiwan
Refer and Ground Anything Anywhere at Any Granularity
NLP Cloud serves high performance pre-trained or custom models for NER
Fast multimodal LLM for real-time voice interaction and AI apps
A very simple framework for state-of-the-art NLP
Document content and metadata extraction microservice
Extract schema, statistics and entities from datasets
Running large language models on a single GPU
Cloud-native open source data warehouse for analytics and AI queries
A system for agentic LLM-powered data processing and ETL
Biomni: a general-purpose biomedical AI agent
The ultimate RAG for your monorepo
Create prompt-friendly codebase digests from any Git repository URL
Models for the spaCy Natural Language Processing (NLP) library
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Chat & pretrained large audio language model proposed by Alibaba Cloud
Towards Human-Level Text-to-Speech through Style Diffusion
LongBench v2 and LongBench (ACL 25'&24')
Solve end to end problems using Llama model family
Multi-tool for semantic search
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning