State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Library for OCR-related tasks powered by Deep Learning
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
MTEB: Massive Text Embedding Benchmark
A fast and lightweight IDE
Statusline plugin for vim with prompts for several other applications
Mozc - a Japanese Input Method Editor designed for multi-platform
Crowdsourcing platform for full text transcription and tagging
Audiocraft is a library for audio processing and generation
The most accurate natural language detection library for Python
Persian NLP Toolkit
Generating Immersive, Explorable, and Interactive 3D Worlds
TTS with kokoro and onnx runtime
IEEE VHDL-93 LRM supported parser implemented in Java, APIs Python/Tcl
Math OCR model that outputs LaTeX and markdown
A library to help you make the most out of your Pixoo 64
Compute distance between sequences
Multi-Voice and Prompt-Controlled TTS Engine
The behavior guidance framework for customer-facing LLM agents
Tools to ease the creation of snippets, syntax definitions, etc.
Stanford NLP Python library for many human languages
Implementation of Phenaki Video, which uses Mask GIT
Open source no-code system for text annotation and building of text
High accuracy RAG for answering questions from scientific documents