Showing 373 open source projects for "visual-cfd"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 1
    AdalFlow

    AdalFlow

    The library to build & auto-optimize LLM applications

    AdalFlow is a framework for building AI-powered automation workflows, enabling users to design and execute intelligent automation pipelines with minimal coding.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Claw3D

    Claw3D

    Claw3D is an open source 3D engine built on OpenClaw

    ...Instead of relying on traditional dashboards or logs, Claw3D introduces a spatial interface that allows users to navigate through a simulated office and watch agents collaborate, effectively turning abstract processes into tangible visual interactions. The system supports task assignment, progress tracking, and communication between agents, creating a representation of autonomous or semi-autonomous workflows. It can be self-hosted, giving users full control over deployment, customization, and scaling of their AI workspace.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    PyTorch3D

    PyTorch3D

    PyTorch3D is FAIR's library of reusable components for deep learning

    ...The library provides fast GPU-accelerated implementations of rendering pipelines, transformations, rasterization, and lighting—making it possible to compute gradients through full 3D rendering processes. Researchers use it for tasks like shape generation, reconstruction, view synthesis, and visual reasoning. PyTorch3D also includes utilities for loading, transforming, and sampling 3D assets, so models can be trained end-to-end from 2D supervision or partial data. Its modular design allows easy extension—components like differentiable rasterizers, mesh blending, or signed distance field (SDF) modules can be swapped or combined to test new architectures quickly.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4
    llama.vscode

    llama.vscode

    VS Code extension for LLM-assisted code/text completion

    llama.vscode is a Visual Studio Code extension that provides AI-assisted coding features powered primarily by locally running language models. The extension is designed to be lightweight and efficient, enabling developers to use AI tools even on consumer-grade hardware. It integrates with the llama.cpp runtime to run language models locally, eliminating the need to rely entirely on external APIs or cloud providers.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 5
    Kanwas

    Kanwas

    Shared context board for teams and agents

    ...It gives people and agents a common canvas where documents, evidence, decisions, notes, tasks, embeds, and outputs can live side by side. Instead of scattering context across chats, documents, and disconnected tools, Kanwas turns messy collaborative work into a shared visual environment that both humans and agents can read and update. The platform supports real-time collaboration, visible agent tool calls, and a timeline that makes AI activity easier to follow and audit. It is designed for long-running projects where maintaining context matters as much as producing a single response. Overall, Kanwas positions itself as a multiplayer thinking space for AI-assisted research, planning, and execution.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    Phi-3-MLX

    Phi-3-MLX

    Phi-3.5 for Mac: Locally-run Vision and Language Models

    Phi-3-Vision-MLX is an Apple MLX (machine learning on Apple silicon) implementation of Phi-3 Vision, a lightweight multi-modal model designed for vision and language tasks. It focuses on running vision-language AI efficiently on Apple hardware like M1 and M2 chips.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Browser Use MCP Server

    Browser Use MCP Server

    Browse the web, directly from Cursor etc.

    A browser automation server implementing the Model Context Protocol, designed to allow AI assistants to browse the web directly from applications like Cursor. It supports natural language commands for web navigation and interaction. ​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    JAI Workflow

    JAI Workflow

    Build programmatically custom agentic workflows, AI Agents, RAG system

    JAI-Workflow is a framework for building and managing machine learning workflows, streamlining the process from data ingestion to model deployment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Framelink MCP for Figma

    Framelink MCP for Figma

    MCP server enabling AI coding tools to access Figma design data

    Figma-Context-MCP is an open source server that connects Figma design data with AI-powered coding tools through the Model Context Protocol (MCP). It allows coding assistants to retrieve structured information from Figma files so they can better translate visual designs into working code. Instead of relying on screenshots or manual descriptions, Figma-Context-MCP accesses layout, styling, and component metadata directly from the Figma API and presents it in a simplified format optimized for AI models. This transformation reduces unnecessary metadata and focuses on the most relevant design attributes, helping AI coding agents produce more accurate UI implementations. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 10
    Android Use

    Android Use

    Automate native Android apps with AI using accessibility APIs

    android-action-kernel is an open source Python library designed to let AI agents control and automate native Android applications running on real devices or emulators. It fills a gap in automation tooling by focusing on mobile-first workflows where traditional browser or desktop-based automation doesn’t work; such as logistics, gig work, field operations, and other industries reliant on phones or tablets. The project works by using Android’s accessibility API to extract structured UI state...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    OpenSwarm

    OpenSwarm

    Claude code for everything except coding

    ...The included agents can handle research, data analysis, slide decks, documents, images, videos, scheduling, messaging, and other productivity tasks. It is designed for outputs like pitch decks, market research, SEO content, quarterly reports, launch campaigns, visual assets, and multimedia projects. The project can connect to external services through integrations and can be customized into purpose-specific swarms for areas such as SEO, sales, marketing, finance, customer support, or research. Its main appeal is giving technical users a forkable, terminal-based framework for building agent teams that produce polished business and creative deliverables.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Computer Vision in Action

    Computer Vision in Action

    A computer vision closed-loop learning platform

    ...It serves as a hands-on companion for learners and engineers who want to understand not just the theory, but how computer vision is actually implemented for tasks like object detection, image classification, feature tracking, optical flow, and image segmentation. The repository includes structured code examples, scripts, and notebooks that cover pipeline construction, preprocessing, model inference, and visual output rendering, making it easy for newcomers or intermediate practitioners to adapt patterns to their own projects. It also explores how to combine classical computer vision techniques with modern neural network-based models, offering insight into when each approach is most effective.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    VGGT

    VGGT

    [CVPR 2025 Best Paper Award] VGGT

    VGGT is a transformer-based framework aimed at unifying classic visual geometry tasks—such as depth estimation, camera pose recovery, point tracking, and correspondence—under a single model. Rather than training separate networks per task, it shares an encoder and leverages geometric heads/decoders to infer structure and motion from images or short clips. The design emphasizes consistent geometric reasoning: outputs from one head (e.g., correspondences or tracks) reinforce others (e.g., pose or depth), making the system more robust to challenging viewpoints and textures. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    Elyra

    Elyra

    Elyra extends JupyterLab with an AI centric approach

    Elyra is a set of AI-centric extensions to JupyterLab Notebooks. The Elyra Getting Started Guide includes more details on these features. A version-specific summary of new features is located on the releases page.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    DeepSeek VL

    DeepSeek VL

    Towards Real-World Vision-Language Understanding

    DeepSeek-VL is DeepSeek’s initial vision-language model that anchors their multimodal stack. It enables understanding and generation across visual and textual modalities—meaning it can process an image + a prompt, answer questions about images, caption, classify, or reason about visuals in context. The model is likely used internally as the visual encoder backbone for agent use cases, to ground perception in downstream tasks (e.g. answering questions about a screenshot). The repository includes model weights (or pointers to them), evaluation metrics on standard vision + language benchmarks, and configuration or architecture files. ...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 16
    SuperDesign

    SuperDesign

    AI Product Design Agent

    ...With a single prompt, SuperDesign can produce full UI mockups, reusable components, and wireframes, dramatically speeding up the early stages of product creation and iteration. Beyond simple visual generation, it lets users fork and refine designs, supporting a dynamic workflow where ideas evolve quickly within the context of your project. This tool is designed to support modularity and consistency, helping teams maintain a coherent design system without manually crafting every element. Because it is open source, anyone can inspect, extend, and customize the agent to fit specific workflows.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    Claude Code Video Vision

    Claude Code Video Vision

    Give Claude the ability to watch and understand videos

    ...Instead of attempting to directly interpret raw video streams, the system extracts key frames using tools like ffmpeg and processes audio through transcription engines, converting both visual and auditory signals into structured inputs for the model. The result is a perception layer that feeds images and timestamped transcripts into Claude, allowing it to analyze events, answer questions, and summarize content with contextual awareness. The system dynamically adapts how much data it extracts based on the user’s query, adjusting frame rate, resolution, and time windows to optimize both performance and token efficiency. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    OSWorld

    OSWorld

    Benchmarking Multimodal Agents for Open-Ended Tasks

    ...It provides a richly simulated 3D world where multiple agents can interact, perform tasks, and learn complex behaviors. OSWorld emphasizes multi-modal interaction, enabling agents to process visual, auditory, and symbolic data for grounded learning in a simulated world.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Learning Interpretability Tool

    Learning Interpretability Tool

    Interactively analyze ML models to understand their behavior

    The Learning Interpretability Tool (LIT, formerly known as the Language Interpretability Tool) is a visual, interactive ML model-understanding tool that supports text, image, and tabular data. It can be run as a standalone server, or inside of notebook environments such as Colab, Jupyter, and Google Cloud Vertex AI notebooks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    TorchAudio

    TorchAudio

    Data manipulation and transformation for audio signal processing

    The aim of torchaudio is to apply PyTorch to the audio domain. By supporting PyTorch, torchaudio follows the same philosophy of providing strong GPU acceleration, having a focus on trainable features through the autograd system, and having consistent style (tensor names and dimension names). Therefore, it is primarily a machine learning library and not a general signal processing library. The benefits of PyTorch can be seen in torchaudio through having all the computations be through PyTorch...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    AI-Codereview-Gitlab

    AI-Codereview-Gitlab

    GitLab automatic code review tool based on large models

    AI-Codereview-Gitlab is an open-source automation tool that integrates large language models into the GitLab development workflow to perform automated code reviews. The system monitors GitLab repositories and analyzes commits or merge requests using AI models to identify potential issues, coding mistakes, and quality improvements before the code is merged. By leveraging multiple large language model providers—including OpenAI, DeepSeek, ZhipuAI, or local models through Ollama—the platform...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22
    OpenPromptStudio

    OpenPromptStudio

    Visual editor for AI prompts with translation, categories, and tools

    OpenPromptStudio is an open source visual editor designed to help users create, organize, and manage prompts for AI image generation tools. It focuses on improving the workflow for building prompts by turning them into structured, visual components that are easier to edit and rearrange. It supports the creation and classification of prompt segments, allowing users to organize them into different types such as styles, quality modifiers, commands, or general prompt elements. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Nexent

    Nexent

    Zero-code platform for building AI agents from natural language input

    Nexent is an open source platform designed to enable users to create intelligent agents using natural language instead of traditional programming or visual orchestration tools. It focuses on a zero-code approach, allowing users to define workflows and agent behavior purely through language prompts, significantly lowering the barrier to entry for AI development. Built on the MCP ecosystem, Nexent integrates a wide range of tools, models, and data sources into a unified environment for agent creation and execution. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    Micro Agent

    Micro Agent

    AI CLI agent that writes code by iterating until tests pass

    Micro Agent is a command-line tool designed to generate and refine code using a test-driven approach powered by large language models. Instead of producing one-shot code outputs, it creates or uses test cases and repeatedly iterates on the generated code until those tests pass successfully. This workflow emphasizes reliability by using structured feedback from failing tests to guide improvements, reducing the need for manual debugging and iteration. Micro Agent intentionally limits its scope...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    FireRed-Image-Edit

    FireRed-Image-Edit

    General-purpose image editing model that delivers high-fidelity

    ...It is built on a flexible text-to-image foundation model that has been extended with training paradigms including pretraining, supervised fine-tuning, and reinforcement learning to imbue the system with strong instruction following and editing consistency. The model excels in maintaining visual and text stylistic fidelity, allowing users to preserve the original artistic qualities of an image while applying creative changes according to natural language instructions. In addition to editing single images, FireRed supports multi-image editing scenarios such as virtual try-on or batch transformations, making it suitable for both creative and practical workflows.
    Downloads: 1 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB