Gracefully face hCaptcha challenge with multimodal llms
Numerical differential equation solvers in JAX
Data science interview questions and answers
Controllable & emotion-expressive zero-shot TTS
End-to-end speech processing toolkit
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Fast image augmentation library and an easy-to-use wrapper
Decomposable Multiscale Mixing for Time Series Forecasting
Quickly get started with AI theory and practical applications
Implementation for MatMul-free LM
A speech-text foundation model for real time dialogue
tensorboard for pytorch (and chainer, mxnet, numpy, etc.)
Open Source Differentiable Computer Vision Library
A fast TTS architecture with conditional flow matching
SOTA discrete acoustic codec models with 40/75 tokens per second
A TTS model capable of generating ultra-realistic dialogue
A simple forecasting package
Build high-performance AI models with modular building blocks
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
The Model Zoo of cognitive diagnosis models
Software that uses AI to perform real-time voice conversion
On-device Speech-to-Intent engine powered by deep learning
Fault-tolerant, highly scalable GPU orchestration
Fast and Easy Infinite Neural Networks in Python
A Python vector database you just need, no more, no less