lightweight package to simplify LLM API calls
Document Image Parsing via Heterogeneous Anchor Prompting”
A gradio web UI for running Large Language Models like LLaMA
Build AI-powered applications with React, Svelte, Vue, and Solid
Easy-to-use Speech Toolkit including Self-Supervised Learning model
The python library for real-time communication
A react-based starter app for using the Live API over websockets
Access to Anthropic's safety-first language model APIs
A HTML5 video player with a parser that saves traffic
StreamSpeech is a seamless model for offline speech recognition
The official Python SDK for the ElevenLabs API
Provides convenient access to the Anthropic REST API from any Python 3
This SDK is now deprecated, use the new unified Google GenAI SDK
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Execute SQL queries and manage databases seamlessly with Timeplus
Towards Human-Sounding Speech
Virtual AI anchor that combines state-of-the-art technology
Build voice-based LLM agents. Modular + open source
OpenAI Assistants API quickstart with Next.js
Capable of understanding text, audio, vision, video
A middleware to provide an openAI compatible endpoint
Reverse-engineered Python API for Google Gemini web app
Browser extension and cross-platform desktop app based on ChatGPT API
Built for demanding AI workflows
NVR with realtime local object detection for IP cameras