Showing 229 open source projects for "real time"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    HY-World 1.5

    HY-World 1.5

    A Systematic Framework for Interactive World Modeling

    ...It aims to empower AI agents with the capability to both understand and generate multimedia content — including text, audio, image, and potentially 3D or game-world elements — enabling lifelike dialogue, environmental interpretations, and responsive world behavior. The platform targets use cases in digital entertainment, game worlds, training simulators, and interactive storytelling, where AI agents need to adapt to real-time user inputs and changes in environment state. It blends advanced reasoning with multimodal synthesis, enabling agents to describe scenes, generate context-appropriate responses, and contribute to narrative or gameplay flows. The underlying framework typically supports large-context state tracking across extended interactions, blending temporal and spatial multimodal signals.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 2
    VideoChat

    VideoChat

    Real-time voice interactive digital human

    VideoChat is a real-time voice-interactive “digital human” system that combines automatic speech recognition, large language models, text-to-speech, and talking-head generation into a single conversational pipeline. It supports both pure end-to-end voice solutions based on multimodal large language models (GLM-4-Voice feeding directly into talking-head generation) and a more traditional cascaded pipeline using ASR → LLM → TTS → talking head.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    pg-aiguide

    pg-aiguide

    MCP server and Claude plugin for Postgres skills and documentation

    ...It acts as a bridge between database documentation and AI tools, enabling more accurate generation of SQL queries and database interactions. The system integrates with Claude Code through an MCP server, allowing agents to access curated PostgreSQL knowledge in real time. It focuses on improving developer productivity by reducing errors and providing context-aware suggestions. The project also includes documentation and examples that help users understand best practices in database design and querying. It is particularly useful for teams working heavily with PostgreSQL who want to leverage AI tools effectively. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    ClawTeam

    ClawTeam

    ClawTeam: Agent Swarm Intelligence (One Command → Full Automation)

    ...Instead of relying on a single agent, the system allows a leader agent to spawn and coordinate multiple specialized sub-agents, each responsible for different aspects of a problem. These agents communicate, share insights, and dynamically adapt their strategies based on real-time feedback, creating a form of collective intelligence. The framework supports a wide range of use cases, including software development, machine learning research, financial analysis, and content production. It is designed to work with various AI tools and command-line agents, making it highly flexible and extensible. ClawTeam also includes monitoring tools such as dashboards and tmux-based views to observe agent activity and progress.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 5
    hls4ml

    hls4ml

    Machine learning on FPGAs using HLS

    ...This approach allows machine learning algorithms to run directly on specialized hardware, making them suitable for applications that require extremely fast response times and minimal power consumption. The framework was originally developed for high-energy physics experiments where real-time decision systems must process large volumes of data with strict latency constraints. Over time, it has expanded to support a variety of scientific and industrial applications including signal processing, embedded systems, and biomedical monitoring.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Orpheus TTS

    Orpheus TTS

    Towards Human-Sounding Speech

    ...Inference is provided through a Python package that uses vLLM under the hood for high-throughput, low-latency generation, including streaming examples that show how to generate audio chunks in real time. The maintainers provide Colab notebooks, a standardized prompting format, and one-click deployment via Baseten for production-grade, FP8/FP16 optimized inference with ~200 ms streaming latency.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 7
    SwarmZero

    SwarmZero

    SwarmZero's SDK for building AI agents, swarms of agents and much more

    SwarmZero is an open-source platform designed for deploying and managing autonomous robot swarms. It enables collective coordination, decentralized decision-making, and real-time collaboration among large groups of autonomous agents, focusing on multi-robot systems and research in swarm robotics.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Norfair

    Norfair

    Lightweight Python library for adding real-time multi-object tracking

    Norfair is a customizable lightweight Python library for real-time multi-object tracking. Using Norfair, you can add tracking capabilities to any detector with just a few lines of code. Any detector expressing its detections as a series of (x, y) coordinates can be used with Norfair. This includes detectors performing tasks such as object or keypoint detection. It can easily be inserted into complex video processing pipelines to add tracking to existing projects.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    AgentScope

    AgentScope

    Build and run agents you can see, understand and trust

    ...It provides essential abstractions that evolve with advancing LLM capabilities, emphasizing reasoning, tool use, and flexible orchestration rather than rigid prompt constraints. With built-in support for ReAct agents, memory, planning, human-in-the-loop control, and real-time voice interaction, developers can create powerful agents in minutes. AgentScope integrates seamlessly with tools, long-term memory systems, MCP, A2A (Agent-to-Agent) protocols, and observability frameworks. It also supports reinforcement learning workflows for tuning agents and improving performance across complex tasks. Deployable locally, serverless in the cloud, or on Kubernetes with OpenTelemetry support, AgentScope is built for both experimentation and production environments.
    Downloads: 3 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 10
    Flowly AI

    Flowly AI

    Flowly is 100x faster than OpenClaw

    ...It features a multi-agent architecture where different specialized agents can collaborate, delegate tasks, and operate in parallel. Flowly also includes voice capabilities, enabling real-time phone interactions using speech-to-text and text-to-speech systems. Overall, it provides a powerful, extensible, and privacy-focused alternative to cloud-based AI assistants.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    KaTrain

    KaTrain

    Improve your Baduk skills by training with KataGo

    KaTrain is an advanced training and analysis tool for the board game Go that leverages the powerful KataGo AI engine to provide real-time feedback and in-depth game review capabilities. It is designed to help players of all skill levels improve by identifying mistakes, analyzing move efficiency, and offering alternative strategies based on AI evaluation. The application allows users to play against AI opponents with adjustable difficulty, including intentionally weakened versions of the engine that simulate human-like play styles. ...
    Downloads: 50 This Week
    Last Update:
    See Project
  • 12
    BlenderMCP

    BlenderMCP

    Blender Model Context Protocol Integration

    ...It allows users to control Blender using natural language prompts, effectively turning AI into a co-creator for 3D modeling, scene construction, and asset manipulation. The system establishes a two-way communication channel between Blender and the AI, where commands can be sent and results retrieved in real time. It includes features for object manipulation, material editing, and scene inspection, giving the AI deep control over the modeling environment. The project also supports integration with external asset sources such as Sketchfab and Poly Haven, expanding the range of available resources. Additionally, it allows execution of Python scripts within Blender through AI commands, enabling advanced automation and customization.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    CAG

    CAG

    Cache-Augmented Generation: A Simple, Efficient Alternative to RAG

    CAG, or Cache-Augmented Generation, is an experimental framework that explores an alternative architecture for integrating external knowledge into large language model responses. Traditional retrieval-augmented generation systems rely on real-time retrieval of documents from databases or vector stores during inference. CAG proposes a different approach by preloading relevant knowledge into the model’s context window and precomputing the model’s key-value cache before queries are processed. This strategy allows the model to generate responses using the cached context directly, eliminating the need for repeated retrieval operations during runtime. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    WorldGen

    WorldGen

    Generate Any 3D Scene in Seconds

    ...It supports a wide variety of scenes, including both indoor and outdoor settings, and can handle realistic as well as stylized or fantastical environments. Rendering is decoupled from generation, so you can render at arbitrary resolutions and camera trajectories in real time, which makes it easier to integrate into custom pipelines.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    FireRedTTS-2

    FireRedTTS-2

    Long-form streaming TTS system for multi-speaker dialogue generation

    FireRedTTS2 is a next-generation open-source text-to-speech (TTS) system focused on long-form, streaming speech synthesis for multi-speaker dialogue, delivering stable natural speech with context-aware prosody and reliable speaker transitions that support real-time and conversational applications. It features a specialized streaming speech tokenizer and a dual-transformer architecture that enables low latency and high-quality synthesis, making it suitable for interactive systems like chatbots, podcasts, and applications where dynamic turn-taking between speakers is essential. FireRedTTS2 supports multilingual output and speaker flexibility, enabling scenarios that involve language switching, cross-lingual voice cloning, and expressive dialogue generation that maintains consistency over longer utterances.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Qwen3-ASR

    Qwen3-ASR

    Qwen3-ASR is an open-source series of ASR models

    Qwen3-ASR is an automatic speech recognition system in the QwenLM family, developed to convert spoken language into text with strong accuracy and real-time performance. As a specialized ASR variant of the broader Qwen language model ecosystem, it focuses on capturing reliable transcriptions from audio sources such as recordings, live streams, or conversational inputs while supporting low latency use cases. The architecture combines advanced neural acoustic modeling with context-aware language prediction so that outputs maintain both fidelity to the original speech and grammatical coherence. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    AI Runner

    AI Runner

    Offline inference engine for art, real-time voice conversations

    AI Runner is an offline inference engine designed to run a collection of AI workloads on your own machine, including image generation for art, real-time voice conversations, LLM-powered chatbots and automated workflows. It is implemented as a desktop-oriented Python application and emphasizes privacy and self-hosting, allowing users to work with text-to-speech, speech-to-text, text-to-image and multimodal models without sending data to external services. At the core of its LLM stack is a mode-based architecture with specialized “modes” such as Author, Code, Research, QA and General, and a workflow manager that automatically routes user requests to the right agent based on the task. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 18
    NErlNet

    NErlNet

    Nerlnet is a framework for research and development

    NErlNet is a research-grade framework for distributed machine learning over IoT and edge devices. Built with Erlang (Cowboy HTTP), OpenNN, and Python (Flask), it enables simulation of clusters on a single machine or real deployment across heterogeneous devices.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Voice-Pro

    Voice-Pro

    Comprehensive Gradio WebUI for audio processing

    ...It can be easily installed with one click. Create a virtual environment using Miniconda, running completely separate from the Windows system (fully portable). Supports real-time transcription and translation, as well as batch mode.
    Downloads: 33 This Week
    Last Update:
    See Project
  • 20
    WhisperSpeech

    WhisperSpeech

    An Open Source text-to-speech system built by inverting Whisper

    ...The repository includes notebooks and scripts for inference, long-form synthesis, and finetuning, as well as pre-trained models and converted datasets hosted on Hugging Face. Performance optimizations like torch.compile, KV-caching, and architectural tweaks allow the main model to reach up to 12× real-time speed on a consumer RTX 4090.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    Gemini Fullstack LangGraph Quickstart

    Gemini Fullstack LangGraph Quickstart

    Get started w/ building Fullstack Agents using Gemini 2.5 & LangGraph

    gemini-fullstack-langgraph-quickstart is a fullstack reference application from Google DeepMind’s Gemini team that demonstrates how to build a research-augmented conversational AI system using LangGraph and Google Gemini models. The project features a React (Vite) frontend and a LangGraph/FastAPI backend designed to work together seamlessly for real-time research and reasoning tasks. The backend agent dynamically generates search queries based on user input, retrieves information via the Google Search API, and performs reflective reasoning to identify knowledge gaps. It then iteratively refines its search until it produces a comprehensive, well-cited answer synthesized by the Gemini model. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22
    AI Agent Deep Dive

    AI Agent Deep Dive

    AI Agent Source Code Deep Research Report

    ...It explores how agents interact with environments, execute tasks, and maintain context over time, highlighting both strengths and limitations of current approaches. The repository likely includes diagrams, annotated code samples, and conceptual walkthroughs that mirror real production systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Databend

    Databend

    Cloud-native open source data warehouse for analytics and AI queries

    ...Databend provides a unified engine capable of handling analytics, vector search, and full-text search within a single platform. Databend supports SQL-based workflows and enables real-time data ingestion, transformation, and analysis through streaming and task orchestration features. With its cloud-native design and distributed architecture, Databend can run both as a self-hosted system or within managed environments to power data analytics, AI workloads, and large-scale data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    LLM Colosseum

    LLM Colosseum

    Benchmark LLMs by fighting in Street Fighter 3

    ...The system places language models inside the environment of the classic video game Street Fighter III, where they must interpret the game state and decide which actions to perform during combat. This setup creates a dynamic environment that tests reasoning, situational awareness, and decision-making abilities in real time. Instead of relying purely on reward signals as in reinforcement learning agents, the models analyze contextual information and generate strategic actions based on the game environment. Performance is evaluated using a competitive ranking system that assigns models an ELO rating based on their results across matches against other models.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    LLM Guard

    LLM Guard

    The Security Toolkit for LLM Interactions

    ...The toolkit also helps prevent sensitive information leaks by identifying secrets such as API keys or credentials before they are processed by the model. LLM Guard supports both input and output filtering pipelines, allowing developers to sanitize prompts and validate generated responses in real time. The library integrates easily with existing AI frameworks and can be deployed in production environments to enhance the security posture of LLM-based applications.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB