Search Results for "visual-cfd" - Page 2

Sort By:

Showing 501 open source projects for "visual-cfd"

View related business solutions

Python Clear Filters & Widen Search

Go from Code to Production URL in Seconds
Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.

Try it free
Build Securely on Azure with Proven Frameworks
Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.

Download Now
1

RobotCode

RobotFramework support for Visual Studio Code

An extension that brings support for RobotFramework to Visual Studio Code, including features like code completion, debugging, test explorer, refactoring and more! With RobotCode you can edit your code with auto-completion, code navigation, syntax checking and many more.

Downloads: 2 This Week

Last Update: 2026-04-01
See Project
2

WiFi DensePose

Turn WiFi signals into real-time human pose estimation and detection

...The repository includes components for data processing, model inference, and real-time visualization, making it suitable for research and experimental deployments. Its architecture emphasizes performance and reproducibility, allowing developers to explore non-visual motion capture systems using accessible hardware. Overall, WiFi DensePose functions as an advanced research-grade toolkit for WiFi-based human sensing and pose estimation.

Downloads: 120 This Week

Last Update: 2026-05-14
See Project
3

Self-Operating Computer

A framework to enable multimodal models to operate a computer

...Notably, it was the first known project to implement a multimodal model capable of viewing and controlling a computer screen. The framework supports features like Optical Character Recognition (OCR) and Set-of-Mark (SoM) prompting to enhance visual grounding capabilities. It is designed to be compatible with macOS, Windows, and Linux (with X server installed), and is released under the MIT license.

1 Review

Downloads: 8 This Week

Last Update: 2025-02-28
See Project
4

Flowsint

Graph-based OSINT investigation platform w visual relationship mapping

Flowsint is an open source OSINT investigation platform designed to help analysts explore and understand relationships between digital entities through a visual graph interface. The platform focuses on reconnaissance and open source intelligence workflows, enabling investigators to map connections between domains, IP addresses, organizations, individuals, and other data points. By presenting these relationships in an interactive graph, Flowsint allows users to quickly identify patterns, associations, and investigative leads that might be difficult to detect through traditional data analysis methods. ...

Downloads: 3 This Week

Last Update: 2026-04-11
See Project
Auth0 B2B Essentials: SSO, MFA, and RBAC Built In
Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.

Sign Up Free
5

PySpur

Visual tool for building, testing, and deploying AI agent workflows

...By offering a visual representation of workflows, PySpur makes it easier to debug interactions between components and identify failures in complex pipelines. It supports iterative experimentation, allowing developers to rapidly improve agents without rebuilding systems from scratch. PySpur also enables deployment of finalized workflows after testing, making it suitable for both development and production use.

Downloads: 0 This Week

Last Update: 2026-03-17
See Project
6

CogVLM

A state-of-the-art open visual language model

CogVLM is an open-source visual–language model suite—and its GUI-oriented sibling CogAgent—aimed at image understanding, grounding, and multi-turn dialogue, with optional agent actions on real UI screenshots. The flagship CogVLM-17B combines ~10B visual parameters with ~7B language parameters and supports 490×490 inputs; CogAgent-18B extends this to 1120×1120 and adds plan/next-action outputs plus grounded operation coordinates for GUI tasks.

Downloads: 0 This Week

Last Update: 4 days ago
See Project
7

NVIDIA AI Blueprint

Suite of reference architectures for building GPU-accelerated vision

...The project is organized around real-time video intelligence, downstream analytics, and agentic offline processing. It supports workflows such as natural-language video search, visual question answering, long-video summarization, clip retrieval, verified alerts, and incident analysis. It is designed for technical users who need deployable reference architectures for smart spaces, warehouse automation, SOP validation, monitoring, and operational video analytics. The repository includes Python agent code, Docker Compose deployment configurations, skills, scripts, and a Next.js-based UI.

Downloads: 6 This Week

Last Update: 6 days ago
See Project
8

ComfyUI-3D-Pack

An extensive node suite that enables ComfyUI to process 3D inputs

...It incorporates modern 3D generation technologies including neural radiance fields, Gaussian splatting, and other AI-driven reconstruction techniques. Through these nodes, users can convert images into 3D models, manipulate geometry, and experiment with generative 3D workflows inside the visual pipeline editor.

Downloads: 2 This Week

Last Update: 2026-03-11
See Project
9

StarVector

StarVector is a foundation model for SVG generation

...The system treats vector graphics creation as a code generation problem, producing SVG code that can render detailed vector images. Its architecture combines computer vision techniques with language modeling capabilities so it can understand visual inputs and textual prompts simultaneously. The model converts raster images or text instructions into structured vector representations, enabling high-quality vectorization and design generation. This approach allows StarVector to create scalable graphics that maintain visual quality regardless of resolution, which is especially useful for design tools and illustration workflows. ...

Downloads: 2 This Week

Last Update: 2026-03-05
See Project
Secure File Transfer for Windows with Cerberus by Redwood
Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.

Try for Free
10

FEATool Multiphysics - FEA & CFD Toolbox

FEATool Multiphysics is an easy-to-use FEA and CFD Simulation Toolbox

...In addition to built-in CAD and pre/post-processing, automatic mesh generation, and multi-physics solvers, FEATool also seamlessly integrates the state-of-the-art OpenFOAM [1], SU2, and FEniCS CFD and FEA solvers. Moreover, native itegration with MATLAB and Python allows advanced simulation scripts and programmatic use cases such as in AI and machine learning [2]. [1]: https://www.featool.com/Easy-to-Use-OpenFOAM-GUI/ [2]: https://featool.com/model-showcase/2024-11-12-CFD-and-Flow-Prediction-Based-on-Deep-Learning

4 Reviews

Downloads: 10 This Week

Last Update: 2026-02-07
See Project
11

LTX-2.3

Official Python inference and LoRA trainer package

...Unlike most earlier video generation systems that only produced silent clips, LTX-2 combines video and audio generation in a unified architecture capable of producing coherent audiovisual scenes. The model uses a diffusion-transformer-based architecture designed to generate high-fidelity visual frames while simultaneously producing corresponding audio elements such as speech, music, ambient sound, or effects. This unified approach allows creators to generate complete multimedia sequences where motion, timing, and sound are aligned automatically. LTX-2 is designed for both research and production workflows and can generate high-resolution video clips with precise control over structure, motion, and camera behavior.

Downloads: 95 This Week

Last Update: 2026-05-11
See Project
12

Book2_Beauty-of-Data-Visualization

Machine Learning, Criticism and Correction

Book2_Beauty-of-Data-Visualization is an open educational project that teaches the principles and techniques of effective data visualization using Python and modern plotting libraries. The repository focuses on both the technical and aesthetic aspects of visual analytics, helping learners understand how to communicate data clearly and persuasively. It includes practical examples that demonstrate how different chart types reveal patterns, trends, and distributions in real datasets. The material emphasizes visual storytelling and design thinking alongside coding implementation. By combining theory with hands-on plotting exercises, the book helps readers build both analytical and presentation skills. ...

Downloads: 0 This Week

Last Update: 2026-05-01
See Project
13

Watermark-Removal

Machine learning image inpainting task that removes watermarks

Watermark-Removal repository is a machine learning project focused on removing visible watermarks from digital images using deep learning and image inpainting techniques. The system analyzes an image containing a watermark and attempts to reconstruct the underlying visual content so that the watermark is removed while preserving the original appearance of the image. The project uses neural network models inspired by research in contextual attention and gated convolution, which are methods commonly applied to image restoration tasks. Through these techniques, the model learns to identify regions of the image affected by the watermark and generate realistic replacements for the missing visual information. ...

Downloads: 5 This Week

Last Update: 2026-03-11
See Project
14

video-use

Edit videos with Claude Code

...Designed to work with Claude Code, it automates the entire editing process—from cutting clips to rendering the final output—without requiring manual timelines or complex software interfaces. The system intelligently analyzes audio transcripts and visual cues to make precise, context-aware editing decisions. It supports a wide range of content types, including interviews, tutorials, montages, and talking-head videos. By combining structured text representations with on-demand visual previews, it minimizes processing overhead while maintaining high-quality results. Overall, Video Use reimagines video editing as an AI-driven, conversational workflow.

Downloads: 12 This Week

Last Update: 6 days ago
See Project
15

LLM Vision

Visual intelligence for your home.

...The project enables Home Assistant to analyze images, video files, and live camera feeds using vision-capable AI models. Instead of relying only on traditional object detection pipelines, it allows users to send prompts about visual content and receive contextual descriptions or answers about what is happening in camera footage. The system can process events from surveillance platforms such as Frigate and convert them into meaningful summaries, notifications, or structured data for automation workflows. It also maintains a timeline of analyzed camera events that can be displayed in dashboards or queried through the assistant interface.

Downloads: 1 This Week

Last Update: 2026-03-09
See Project
16

VOID

Video Object and Interaction Deletion

VOID is an advanced AI video processing system developed by Netflix that focuses on removing objects from videos while preserving the physical and visual realism of the surrounding environment. Unlike traditional inpainting methods that only erase pixels or simple artifacts, VOID models the full interaction dynamics between objects and their environment, including shadows, reflections, and even physical consequences such as movement or balance changes. Built on top of transformer-based architectures and fine-tuned for video inpainting tasks, the system uses interaction-aware mask conditioning to ensure temporal consistency across frames. ...

Downloads: 6 This Week

Last Update: 2026-05-04
See Project
17

Minegrub

A Grub Theme in the style of Minecraft!

A Grub Theme in the style of Minecraft. A Grub theme inspired by Minecraft, adding visual enhancements to the bootloader.

Downloads: 7 This Week

Last Update: 2024-11-18
See Project
18

GLM-4.6V

GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning

GLM-4.6V represents the latest generation of the GLM-V family and marks a major step forward in multimodal AI by combining advanced vision-language understanding with native “tool-call” capabilities, long-context reasoning, and strong generalization across domains. Unlike many vision-language models that treat images and text separately or require intermediate conversions, GLM-4.6V allows inputs such as images, screenshots or document pages directly as part of its reasoning pipeline — and...

Downloads: 4 This Week

Last Update: 5 days ago
See Project
19

VMZ (Video Model Zoo)

VMZ: Model Zoo for Video Modeling

The codebase was designed to help researchers and practitioners quickly reproduce FAIR’s results and leverage robust pre-trained backbones for downstream tasks. It also integrates Gradient Blending, an audio-visual modeling method that fuses modalities effectively (available in the Caffe2 implementation). Although VMZ is now archived and no longer actively maintained, it remains a valuable reference for understanding early large-scale video model training, transfer learning, and multimodal integration strategies that influenced modern architectures like SlowFast and X3D.

Downloads: 5 This Week

Last Update: 7 days ago
See Project
20

Book4_Power-of-Matrix

Book_4_Matrix Power | The Iris Book: From Addition, Subtraction

Book4_Power-of-Matrix is an open educational repository that forms part of the Visualize-ML book series, focusing on explaining matrix mathematics and linear algebra concepts through visual and intuitive methods. The project is designed to help readers progress from basic arithmetic toward machine learning fundamentals by building a strong conceptual understanding of vectors, matrices, and their operations. It combines explanatory text, diagrams, and Python examples to bridge theory and practical computation. The material emphasizes geometric interpretation and visual reasoning, which makes abstract linear algebra topics more accessible to beginners and self-learners. ...

Downloads: 0 This Week

Last Update: 2026-05-01
See Project
21

VideoRAG

"VideoRAG: Chat with Your Videos

VideoRAG is a retrieval-augmented generation (RAG) framework tailored for video content that enables AI systems to answer questions, summarize, and reason over long videos by combining visual embeddings with contextual search. The system works by first breaking video into clips, extracting visual and audio-textual features, and indexing them into embeddings, then using an LLM with a retriever to pull relevant segments on demand. When a user query is received, VideoRAG locates semantically relevant moments in the video using the embedding index, retrieves associated clips or transcripts, and feeds them to a generative model to produce accurate, grounded answers or summaries. ...

Downloads: 0 This Week

Last Update: 2026-03-18
See Project
22

DeepWiki Open

AI-Powered Wiki Generator for GitHub/Gitlab/Bitbucket Repositories

...Users can enter a repository URL and the system will clone the project, build semantic embeddings of its codebase, extract architecture and relationships, generate human-readable documentation, and produce visual diagrams to help explain complex code structure. DeepWiki’s output turns raw repositories into interactive, web-style wikis complete with navigable sections, diagrams, and contextual explanations, making it easier for developers and collaborators to understand unfamiliar code. It includes an “Ask” feature that lets users query the generated wiki using RAG-style retrieval, enabling interactive question-answering and exploration.

Downloads: 4 This Week

Last Update: 5 days ago
See Project
23

DeepSeek VL2

Mixture-of-Experts Vision-Language Models for Advanced Multimodal

...or “Generate a caption appropriate to context”). The model supports both image understanding (vision tasks) and multimodal reasoning, and is likely used as a component in agent systems to process visual inputs as context for downstream tasks. The repository includes evaluation results (e.g. image/text alignment scores, common VL benchmarks), configuration files, and model weights (where permitted). While the internal architecture details are not fully documented publicly, the repo suggests that VL2 introduces enhancements over prior vision-language models (e.g. better scaling, cross-modal attention, more robust alignment) to improve grounding and multimodal understanding.

Downloads: 7 This Week

Last Update: 2025-10-03
See Project
24

UFO³

Weaving the Digital Agent Galaxy

...The system allows users to issue natural language instructions that are translated into automated actions across multiple desktop applications. Using a dual-agent architecture, the framework analyzes both visual interface elements and system control structures in order to understand how applications should be manipulated. This enables the agent to navigate complex software environments and perform tasks that normally require manual interaction. UFO integrates mechanisms for task decomposition, planning, and execution so that high-level user requests can be broken down into smaller steps performed by specialized agents. ...

Downloads: 3 This Week

Last Update: 6 days ago
See Project
25

Depth Anything 3

Recovering the Visual Space from Any Views

Depth Anything 3 is a research-driven project that brings accurate and dense depth estimation to any input image or video, enabling foundational understanding of 3D structure from 2D visual content. Designed to work across diverse scenes, lighting conditions, and image types, it uses advanced neural networks trained on large, heterogeneous datasets, producing depth maps that reveal scene depth relationships and object surfaces with strong fidelity. The model can be applied to photography, AR/VR content creation, robotics perception, and 3D reconstruction workflows, making it versatile across industries and research domains. ...

Downloads: 3 This Week

Last Update: 2026-03-21
See Project