Concatenate a directory full of files into a single prompt
Use Microsoft Edge's online text-to-speech service from Python
Python library and CLI tool to interface with Google Translate
A nearly-live implementation of OpenAI's Whisper
Transformers4Rec is a flexible and efficient library
Models for object and human mesh reconstruction
Get a ChatGPT plugin up and running in under 5 minutes
TTS with kokoro and onnx runtime
An AI-powered security review GitHub Action using Claude
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
The official PyTorch implementation of Google's Gemma models
FlashMLA: Efficient Multi-head Latent Attention Kernels
Documentation for Google's Gen AI site - including Gemini API & Gemma
Towards Real-World Vision-Language Understanding
Towards Human-Level Text-to-Speech through Style Diffusion
Framework for building neural networks
A fast TTS architecture with conditional flow matching
The ChatGPT Retrieval Plugin lets you easily find personal documents
A dev-first open source autonomous AI agent framework
fast C++ library for linear algebra & scientific computing
Plug-n-play module turning text-to-image models into animation
Chinese text-to-speech engine
800,000 step-level correctness labels on LLM solutions to MATH problem
Task of transcribing piano recordings into MIDI files
Classical piano MIDI dataset