Bailing is a voice dialogue robot similar to GPT-4o
Build Vision Agents quickly with any model or video provider
An open source implementation of CLIP
Toolkit for audio, music, and speech generation
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Extract schema, statistics and entities from datasets
Data loaders and abstractions for text and NLP
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Concatenate a directory full of files into a single prompt
One-click deployment (including offline integration package)
Model Context Protocol Server for Apache OpenDAL™
Easy-to-use and high-performance NLP and LLM framework
Fast and customizable framework for automatic ML model creation
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Qwen3-Coder is the code version of Qwen3
A Python package for segmenting geospatial data with the SAM
Code for the paper Language Models are Unsupervised Multitask Learners
Open-source framework for conversational voice AI agents
Open source libraries and APIs to build custom preprocessing pipelines
OCR expert VLM powered by Hunyuan's native multimodal architecture
Models for the spaCy Natural Language Processing (NLP) library
Framework for building neural networks
Low-latency REST API for serving text-embeddings
Open-source choice to scale, assess and maintain natural language data
Stable Diffusion built-in to Blender