AI tool converting video/audio into structured documents instantly
A simple native web interface that uses ChatTTS to synthesize text
A python tool that uses GPT-4, FFmpeg, and OpenCV
Implementation of "MobileCLIP" CVPR 2024
Build Vision Agents quickly with any model or video provider
Universal LLM Deployment Engine with ML Compilation
A nearly-live implementation of OpenAI's Whisper
A Telegram bot that integrates with OpenAI's official ChatGPT APIs
Synchronized Translation for Videos
Open Source Computer Vision Library
AI Suite for upscaling, interpolating & restoring images/videos
A Conversational Speech Generation Model
TensorFlow documentation
Reference implementation of the Transformer architecture optimized
OpenSourceTelegramRAT - Remote PC access via Telegram Bot.
DCVGAN: Depth Conditional Video Generation, ICIP 2019.
3D ResNets for Action Recognition (CVPR 2018)
Just Another Speech Recognition and Text to Speech software.