Open source personal AI Assistant for Linux, Windows and Mac
Ready-to-run cloud templates for RAG
Implementation of "MobileCLIP" CVPR 2024
The official Python SDK for the ElevenLabs API
A Model Context Protocol server for searching and analyzing arXiv
Speech recognition module for Python
21 Lessons, Get Started Building with Generative AI
Build cross-modal and multimodal applications on the cloud
Generate blog articles from video or audio
Controllable and fast Text-to-Speech for over 7000 languages
Create prompt-friendly codebase digests from any Git repository URL
OpenRecall is a fully open-source, privacy-first alternative
Open source healthcare AI
Practical productivity tools for Claude Code, Codex-CLI
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
This repository provides an advanced RAG
AutoGluon: AutoML for Image, Text, and Tabular Data
Automatically translates the text of a video based on a subtitle file
Parse files for optimal RAG
Concatenate a directory full of files into a single prompt
Making RAG Simpler with Small and Open-Sourced Language Models
An open source implementation of CLIP
An opinionated CLI to transcribe Audio files w/ Whisper on-device
Document content and metadata extraction microservice
Context-aware desktop AI assistant that understands screen content