Data Infrastructure providing an approach to multimodal AI workloads
Enhances Tesseract OCR output using LLMs (local or API)
Open-Source Financial Large Language Models
GLM-4-Voice | End-to-End Chinese-English Conversational Model
tiktoken is a fast BPE tokeniser for use with OpenAI's models
21 Lessons, Get Started Building with Generative AI
AI-powered tool for developers, simplifying coding tasks
A collaboration friendly studio for NeRFs
The Multi-Agent Framework
Industrial-strength Natural Language Processing (NLP)
ExtractThinker is a Document Intelligence library for LLMs
A Repo For Document AI
Operating LLMs in production
Automatically translates the text of a video based on a subtitle file
Foundation Models for Time Series
Run a full local LLM stack with one command using Docker
Open source AI model for generating full songs from lyrics prompts
AI tool for real-time monitoring and analysis of Goofish listings
An AI-powered file management tool that ensures privacy
Controllable & emotion-expressive zero-shot TTS
The official Python SDK for the ElevenLabs API
Code for running inference with the SAM 3D Body Model 3DB
An AI-powered security review GitHub Action using Claude
Renderer for the harmony response format to be used with gpt-oss
kaldi-asr/kaldi is the official location of the Kaldi project