OCRmyPDF adds an OCR text layer to scanned PDF files
Framework for building AI-powered interactive digital humans and agent
NeuTTS model built from small LLM backbones
On-device TTS model by Neuphonic
Machine learning on FPGAs using HLS
Offline Text To Speech synthesis for python
SOTA Open Source TTS
MOSS-TTS-Nano is an open-source multilingual tiny speech generation
A Powerful Native Multimodal Model for Image Generation
Create UIs for your machine learning model in Python in 3 minutes
A Domain-Fronting Relay that routes traffic though GAS
Parse files for optimal RAG
Open-source multi-speaker long-form text-to-speech model
AI agent microservice
Numerical differential equation solvers in JAX
Making RAG Simpler with Small and Open-Sourced Language Models
Automated translation solution for visual novels
A Model Context Protocol server for searching and analyzing arXiv
The collaborative spreadsheet for AI
Implementation of 'lightweight' GAN, proposed in ICLR 2021
Visual Automation IDE — automate anything you see on screen
Shinkai allows you to create advanced AI (local) agents effortlessly
computer vision projects | Fun AI projects related to computer vision
Open-source tool to visualise your RAG
Chat language model that can use tools and interpret the results