Testing tool for modeling GUI transitions
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Open source AI model for generating full songs from lyrics prompts
Open Source OCR Engine
FAIR Sequence Modeling Toolkit 2
kaldi-asr/kaldi is the official location of the Kaldi project
Low-latency AI inference engine optimized for mobile devices
VMZ: Model Zoo for Video Modeling
Code for Cicero, an AI agent that plays the game of Diplomacy
Build your own AI friend
Speech-to-text, text-to-speech, and speaker recognition
Powerful Android AI agent with tools, automation, and Linux shell
Port of OpenAI's Whisper model in C/C++
Port of Facebook's LLaMA model in C/C++
Run Local LLMs on Any Device. Open-source
Clean and efficient FP8 GEMM kernels with fine-grained scaling
Structure-from-Motion and Multi-View Stereo
CV-CUDA™ is an open-source, GPU accelerated library
Offline speech recognition API for Android, iOS, Raspberry Pi
Audio Plugin for Audio to MIDI transcription using deep learning
Open Source Computer Vision Library
ONNX Runtime: cross-platform, high performance ML inferencing
Awesome multilingual OCR toolkits based on PaddlePaddle
Distribute and run LLMs with a single file
A retargetable MLIR-based machine learning compiler runtime toolkit