Showing 373 open source projects for "visual-cfd"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 1
    VOID

    VOID

    Video Object and Interaction Deletion

    VOID is an advanced AI video processing system developed by Netflix that focuses on removing objects from videos while preserving the physical and visual realism of the surrounding environment. Unlike traditional inpainting methods that only erase pixels or simple artifacts, VOID models the full interaction dynamics between objects and their environment, including shadows, reflections, and even physical consequences such as movement or balance changes. Built on top of transformer-based architectures and fine-tuned for video inpainting tasks, the system uses interaction-aware mask conditioning to ensure temporal consistency across frames. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 2
    GLM-4.6V

    GLM-4.6V

    GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning

    GLM-4.6V represents the latest generation of the GLM-V family and marks a major step forward in multimodal AI by combining advanced vision-language understanding with native “tool-call” capabilities, long-context reasoning, and strong generalization across domains. Unlike many vision-language models that treat images and text separately or require intermediate conversions, GLM-4.6V allows inputs such as images, screenshots or document pages directly as part of its reasoning pipeline — and...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    VMZ (Video Model Zoo)

    VMZ (Video Model Zoo)

    VMZ: Model Zoo for Video Modeling

    The codebase was designed to help researchers and practitioners quickly reproduce FAIR’s results and leverage robust pre-trained backbones for downstream tasks. It also integrates Gradient Blending, an audio-visual modeling method that fuses modalities effectively (available in the Caffe2 implementation). Although VMZ is now archived and no longer actively maintained, it remains a valuable reference for understanding early large-scale video model training, transfer learning, and multimodal integration strategies that influenced modern architectures like SlowFast and X3D.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    Coze Studio

    Coze Studio

    An AI agent development platform with all-in-one visual tools

    Coze Studio is ByteDance’s open‑source, visual AI agent development platform. It offers no-code/low-code workflows to build, debug, and deploy conversational agents, integrating prompting, RAG-based knowledge bases, plugin systems, and workflow orchestration. Developed in Go (backend) and React/TypeScript (frontend), it uses a containerized microservices architecture suitable for enterprise deployment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Auth0 B2B Essentials: SSO, MFA, and RBAC Built In Icon
    Auth0 B2B Essentials: SSO, MFA, and RBAC Built In

    Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

    Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.
    Sign Up Free
  • 5
    OpenWhip

    OpenWhip

    Optimize interaction with AI coding assistants

    OpenWhip is a desktop utility built as a cross-platform Node.js application that humorously gamifies interaction with AI coding assistants by simulating a “whip” tool to interrupt and motivate them during long or stalled operations. The application runs as a lightweight system tray program and overlays a visual whip animation on the screen when activated, creating an interactive and slightly absurd interface for user engagement. Its core functionality is surprisingly practical beneath the joke: when triggered, it sends a keyboard interrupt signal (Ctrl+C) to halt the current AI process, effectively giving developers a quick way to stop unresponsive or slow-running tasks. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    StoryGen Atelier

    StoryGen Atelier

    AI-assisted storyboard and video generation tool

    StoryGen Atelier is an advanced creative tool that blends AI with visual storytelling, making it possible to generate fully structured storyboards and stitched videos from text prompts without requiring manual art or animation skills. Users begin with natural language descriptions of their story or scene, and the system uses state-of-the-art large models to generate both the script and corresponding frames.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    ChainForge

    ChainForge

    An open-source visual programming environment

    ChainForge is an open-source visual programming environment designed to help developers systematically test, compare, and evaluate prompts and outputs across multiple large language models in a structured and scalable way. Instead of relying on isolated prompt experimentation, it introduces a dataflow-based interface that allows users to create complex prompt pipelines and evaluate them across different models, parameters, and datasets simultaneously.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    DeepWiki Open

    DeepWiki Open

    AI-Powered Wiki Generator for GitHub/Gitlab/Bitbucket Repositories

    ...Users can enter a repository URL and the system will clone the project, build semantic embeddings of its codebase, extract architecture and relationships, generate human-readable documentation, and produce visual diagrams to help explain complex code structure. DeepWiki’s output turns raw repositories into interactive, web-style wikis complete with navigable sections, diagrams, and contextual explanations, making it easier for developers and collaborators to understand unfamiliar code. It includes an “Ask” feature that lets users query the generated wiki using RAG-style retrieval, enabling interactive question-answering and exploration.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    ClaudeBar

    ClaudeBar

    A macOS menu bar application that monitors AI coding assistant usage

    ...Rather than constantly running CLI commands or navigating web dashboards, users can glance at their quota statistics for services like Claude, Codex, Gemini, GitHub Copilot, and Antigravity directly from the menu bar. The application provides real-time tracking of session, weekly, and model-specific usage percentages, using visual indicators such as color-coded progress bars to communicate when quotas are healthy, nearing limits, or depleted. It includes options to enable or disable monitoring for individual providers, supports multiple visual themes (including dark mode and a festive theme), and refreshes data at configurable intervals so users always have up-to-date information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 10
    VideoRAG

    VideoRAG

    "VideoRAG: Chat with Your Videos

    VideoRAG is a retrieval-augmented generation (RAG) framework tailored for video content that enables AI systems to answer questions, summarize, and reason over long videos by combining visual embeddings with contextual search. The system works by first breaking video into clips, extracting visual and audio-textual features, and indexing them into embeddings, then using an LLM with a retriever to pull relevant segments on demand. When a user query is received, VideoRAG locates semantically relevant moments in the video using the embedding index, retrieves associated clips or transcripts, and feeds them to a generative model to produce accurate, grounded answers or summaries. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    DeepSeek VL2

    DeepSeek VL2

    Mixture-of-Experts Vision-Language Models for Advanced Multimodal

    ...or “Generate a caption appropriate to context”). The model supports both image understanding (vision tasks) and multimodal reasoning, and is likely used as a component in agent systems to process visual inputs as context for downstream tasks. The repository includes evaluation results (e.g. image/text alignment scores, common VL benchmarks), configuration files, and model weights (where permitted). While the internal architecture details are not fully documented publicly, the repo suggests that VL2 introduces enhancements over prior vision-language models (e.g. better scaling, cross-modal attention, more robust alignment) to improve grounding and multimodal understanding.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 12
    Midscene

    Midscene

    Vision-based AI framework for cross-platform UI automation tasks

    Midscene.js is an open source AI-driven UI automation framework designed to control user interfaces across multiple platforms using natural language instructions. Instead of relying on traditional selectors, DOM structures, or accessibility attributes, it uses a vision-first approach where screenshots are analyzed by visual-language models to identify interface elements and perform actions. It allows developers to automate interactions on web applications, desktop software, and mobile devices without needing platform-specific automation logic. Developers can describe tasks such as clicking buttons, filling forms, or extracting information, and the system interprets these commands to interact with the interface accordingly. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 13
    AliceVision

    AliceVision

    3D Computer Vision Framework

    ...The framework is built with a strong emphasis on research-grade algorithms while maintaining the robustness required for production environments, making it suitable for industries such as visual effects, cultural heritage preservation, and robotics. AliceVision is modular, enabling developers to use individual components or customize the pipeline for specific workflows, including panorama stitching and camera tracking. It integrates with tools like Meshroom, which offers a graphical interface to simplify complex reconstruction processes for non-technical users.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    UFO³

    UFO³

    Weaving the Digital Agent Galaxy

    ...The system allows users to issue natural language instructions that are translated into automated actions across multiple desktop applications. Using a dual-agent architecture, the framework analyzes both visual interface elements and system control structures in order to understand how applications should be manipulated. This enables the agent to navigate complex software environments and perform tasks that normally require manual interaction. UFO integrates mechanisms for task decomposition, planning, and execution so that high-level user requests can be broken down into smaller steps performed by specialized agents. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    Clawra

    Clawra

    Openclaw as your girlfriend

    ...Rather than being a static chatbot tied to a corporate ecosystem, Clawra runs locally or on a private server, giving users full control over the software and data that back her behavior. She is designed not just to answer questions but to maintain a persistent character with memory, backstory, and the ability to present visual outputs like generated selfies through integrated image tools, blending conversational AI with a playful persona. Clawra has captured attention as an experimental project showcasing how far open-source agents can be pushed in creating engaging and personalized interactions, with community interest spiking around her capabilities.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    Depth Anything 3

    Depth Anything 3

    Recovering the Visual Space from Any Views

    Depth Anything 3 is a research-driven project that brings accurate and dense depth estimation to any input image or video, enabling foundational understanding of 3D structure from 2D visual content. Designed to work across diverse scenes, lighting conditions, and image types, it uses advanced neural networks trained on large, heterogeneous datasets, producing depth maps that reveal scene depth relationships and object surfaces with strong fidelity. The model can be applied to photography, AR/VR content creation, robotics perception, and 3D reconstruction workflows, making it versatile across industries and research domains. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 17
    FormCreate

    FormCreate

    The easy-to-use Vue low-code visual AI form designer

    FormCreate is a low-code visual form builder built on Vue that enables developers to create complex, dynamic forms through a drag-and-drop interface rather than manual coding. It is part of the broader form-create ecosystem and leverages JSON-based schema generation to dynamically render forms, handle validation, and manage data collection workflows. The tool is designed to significantly reduce development time by allowing users to visually assemble forms while automatically generating the underlying configuration and logic. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Refly

    Refly

    The first open-source agent skills builder

    Refly is an AI-native workflow platform that democratizes automated workflow and skills creation for both technical and non-technical users by offering a visual, natural-language-driven interface. Instead of requiring code, Refly lets creators define tasks and business logic through simple “vibes,” which are compiled into structured, reusable agent skills that can be executed on engines like Claude Code, Cursor, or other supported runtimes. With a focus on making automation accessible, it provides a visual canvas and low-code components that feel similar to drag-and-drop builders but backed by powerful AI orchestration, memory handling, and integrations with external services. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    agentation

    agentation

    The visual feedback tool for agents

    Agentation is a visual annotation and feedback tool designed to make interacting with AI coding agents more intuitive and precise by letting developers visually click on frontend elements in a browser and annotate them with context before sending structured feedback to an agent. Instead of describing UI elements in text — like “the blue button in the sidebar” — users click directly on elements to automatically capture selectors, positions, and contextual metadata that can be consumed by AI agents to locate exact code references. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Dafthunk

    Dafthunk

    A workflow execution platform built on top of the fantastic Cloudflare

    ...It aims to combine the approachability of a visual editor with the practical needs of real automation: state persistence, execution history, reusable nodes, and integrations with external systems. A key appeal is that you can go from idea to running automation quickly in a hosted-like experience while still keeping the project open source and infrastructure-aware.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    X-AnyLabeling

    X-AnyLabeling

    Effortless data labeling with AI support from Segment Anything

    ...It supports labeling tasks across images and videos and enables developers to prepare training datasets for tasks such as object detection, segmentation, classification, tracking, and pose estimation. The tool is built with an interactive graphical interface that simplifies annotation workflows and allows users to draw and edit labels directly on visual data. It also supports a wide range of export formats compatible with popular machine learning pipelines, making it easier to integrate with training frameworks.
    Downloads: 40 This Week
    Last Update:
    See Project
  • 22
    HunyuanWorld 1.0

    HunyuanWorld 1.0

    Generating Immersive, Explorable, and Interactive 3D Worlds

    ...The architecture integrates panoramic proxy generation, semantic layering, and hierarchical 3D reconstruction to produce high-quality scene-scale 3D worlds from both text and images. HunyuanWorld-1.0 surpasses existing open-source methods in visual quality and geometric consistency, demonstrated by superior scores in BRISQUE, NIQE, Q-Align, and CLIP metrics.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 23
    LlamaParse

    LlamaParse

    Parse files for optimal RAG

    LlamaParse is a GenAI-native document parser that can parse complex document data for any downstream LLM use case (RAG, agents). Load in 160+ data sources and data formats, from unstructured, and semi-structured, to structured data (API's, PDFs, documents, SQL, etc.) Store and index your data for different use cases. Integrate with 40+ vector stores, document stores, graph stores, and SQL db providers.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    InternVL

    InternVL

    A Pioneering Open-Source Alternative to GPT-4o

    InternVL is a large-scale multimodal foundation model designed to integrate computer vision and language understanding within a unified architecture. The project focuses on scaling vision models and aligning them with large language models so that they can perform tasks involving both visual and textual information. InternVL is trained on massive collections of image-text data, enabling it to learn representations that capture both visual patterns and semantic meaning. The model supports a wide variety of tasks, including visual perception, image classification, and cross-modal retrieval between images and text. It can also be connected to language models to enable conversational interfaces that understand images, videos, and other visual content. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    AionUi

    AionUi

    Free, local, open-source Cowork for Gemini CLI, Claude Code, Codex

    AionUi is an open-source, cross-platform graphical interface that turns command-line AI tools into a unified coworking desktop for interacting with multiple local AI agents and CLI models like Gemini CLI, Claude Code, Codex, Qwen Code, and others. Instead of forcing users to work in separate terminals for each tool, AionUi automatically detects installed CLI tools and provides a central visual workspace where sessions can run in parallel, contexts are preserved, and conversations are saved locally without sending data to external servers. It enhances productivity by offering smart file management features like batch renaming, automatic organization, and intelligent file classification, thereby reducing manual overhead when working with large datasets or complex document structures. ...
    Downloads: 46 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB