Page 4 | visual-cfd free download

Showing 373 open source projects for "visual-cfd"

View related business solutions

Artificial Intelligence Linux Clear Filters & Widen Search

Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
$300 in Free Credit Towards Top Cloud Services
Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.

Get Started
1

Model Explorer

A modern model graph visualizer and debugger

Model Explorer is a visual tool for exploring, debugging, and optimizing ML models deployed on edge devices. Developed by Google AI Edge, it offers a browser-based interface to inspect layer-wise performance, memory usage, and inference timing of TensorFlow Lite and other supported models. It’s a powerful utility for developers optimizing models for constrained environments.

Downloads: 0 This Week

Last Update: 2026-02-09
See Project
2

Audiblez

Generate audiobooks from e-books

Audiblez is a tool for generating high-quality .m4b audiobooks directly from .epub e-books using the Kokoro-82M neural text-to-speech model. It focuses on making audiobook creation easy and fast: from a single command, the tool splits an e-book into chapters, synthesizes audio for each section, and then merges the results into a structured audiobook with chapter-based WAV files and a final .m4b container. The Kokoro-82M model it uses is compact (82M parameters) yet natural sounding, trained...

Downloads: 48 This Week

Last Update: 2025-11-30
See Project
3

RuoYi AI

Enterprise AI platform for building, deploying, and managing apps

...RuoYi AI includes built-in support for retrieval-augmented generation, enabling organizations to create secure, private knowledge bases with high-accuracy search and reasoning capabilities. It also offers visual workflow orchestration tools that allow users to design complex AI pipelines, automate tasks, and coordinate multi-agent systems for advanced decision-making scenarios. In addition to backend capabilities, RuoYi AI includes frontend components and administrative dashboards built with modern web technologies, making it a complete end-to-end solution.

Downloads: 4 This Week

Last Update: 2026-04-13
See Project
4

DriveLM

Driving with Graph Visual Question Answering

DriveLM is a research-oriented framework and dataset designed to explore how vision-language models can be integrated into autonomous driving systems. The project introduces a new paradigm called graph visual question answering that structures reasoning about driving scenes through interconnected tasks such as perception, prediction, planning, and motion control. Instead of treating autonomous driving as a purely sensor-driven pipeline, DriveLM frames it as a reasoning problem where models answer structured questions about the environment to guide decision making. ...

Downloads: 0 This Week

Last Update: 2026-03-09
See Project
Streamline Azure Security with Palo Alto Networks VM-Series
Centrally manage physical and virtualized firewalls with Panorama

Improve your security posture and reduce incident response time. Use the VM-Series to natively analyze Azure traffic and dynamically drive policy updates based on workload changes.

Learn more
5

swark.io

Create architecture diagrams from code automatically using LLMs

Swark is an open-source developer tool and Visual Studio Code extension that automatically generates software architecture diagrams directly from source code using large language models. The project aims to help developers quickly understand complex codebases by analyzing repositories and producing visual diagrams that represent system architecture, dependencies, and component relationships.

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
6

LlamaGen

Autoregressive Model Beats Diffusion

LlamaGen is an open-source research project that introduces a new approach to image generation by applying the autoregressive next-token prediction paradigm used in large language models to visual generation tasks. Instead of relying on diffusion models, the framework treats images as sequences of tokens that can be generated progressively using transformer architectures similar to those used for text generation. The project explores how scaling autoregressive models and improving image tokenization techniques can produce competitive results compared with modern diffusion-based image generators. ...

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
7

LandPPT

An LLM-based presentation generation platform

...The application integrates multiple AI models from providers such as OpenAI, Anthropic, Google, and locally hosted models to generate text, images, and structured presentation layouts. It also includes template systems and style options that allow presentations to be customized for different industries, visual themes, or storytelling formats.

Downloads: 2 This Week

Last Update: 6 days ago
See Project
8

LISA

LISA: Reasoning Segmentation via Large Language Model

...The project introduces a framework where a large language model can interpret natural language instructions and produce segmentation masks that highlight relevant regions in an image. Instead of relying solely on predefined object categories, the model is capable of reasoning about complex textual queries and translating them into visual segmentation outputs. This approach allows the system to identify objects or regions in images based on semantic descriptions, contextual reasoning, and world knowledge. The model integrates multimodal capabilities by combining language understanding with visual perception so that text instructions guide the segmentation process. ...

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
9

GLM-4.5V

GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning

GLM-4.5V is the preceding iteration in the GLM-V series that laid much of the groundwork for general multimodal reasoning and vision-language understanding. It embodies the design philosophy of mixing visual and textual modalities into a unified model capable of general-purpose reasoning, content understanding, and generation, while already supporting a wide variety of tasks: from image captioning and visual question answering to content recognition, GUI-based agents, video understanding, and long-document interpretation. GLM-4.5V emerged from a training framework that leverages scalable reinforcement learning (with curriculum sampling) to boost performance across tasks ranging from STEM problem solving to long-context reasoning, giving it broad applicability beyond narrow benchmarks. ...

Downloads: 1 This Week

Last Update: 5 days ago
See Project
Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
10

FastVLM

This repository contains the official implementation of FastVLM

...The repository documents model variants, showcases head-to-head numbers against known baselines, and explains how the encoder integrates with common LLM backbones. Apple’s research brief frames FastVLM as targeting real-time or latency-sensitive scenarios, where lowering visual token pressure is critical to interactive UX. In short, it’s a practical recipe to make VLMs fast without exotic token-selection heuristics.

Downloads: 0 This Week

Last Update: 2025-10-08
See Project
11

ML Ferret

Refer and Ground Anything Anywhere at Any Granularity

Ferret is Apple’s end-to-end multimodal large language model designed specifically for flexible referring and grounding: it can understand references of any granularity (boxes, points, free-form regions) and then ground open-vocabulary descriptions back onto the image. The core idea is a hybrid region representation that mixes discrete coordinates with continuous visual features, so the model can fluidly handle “any-form” referring while maintaining precise spatial localization. The repo presents the vision-language pipeline, model assets, and paper resources that show how Ferret answers questions, follows instructions, and returns grounded outputs rather than just text. In practice, this enables tasks like “find that small red icon next to the chart and describe it” where both the linguistic reference and the visual region are ambiguous without fine spatial reasoning.

Downloads: 0 This Week

Last Update: 2025-10-08
See Project
12

HunyuanVideo-Foley

Multimodal Diffusion with Representation Alignment

HunyuanVideo-Foley is a multimodal diffusion model from Tencent Hunyuan for high-fidelity Foley (sound effects) audio generation synchronized to video scenes. It is designed to generate audio that matches both visual content and textual semantic cues, for use in video production, film, advertising, games, etc. The model architecture aligns audio, video, and text representations to produce realistic synchronized soundtracks. Produces high-quality 48 kHz audio output suitable for professional use. Hybrid architecture combining multimodal transformer blocks and unimodal refinement blocks. ...

Downloads: 2 This Week

Last Update: 2025-09-28
See Project
13

FastGPT

FastGPT is a knowledge-based platform built on the LLMs

FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, letting you easily develop and deploy complex question-answering systems without the need for extensive setup or configuration.

Downloads: 4 This Week

Last Update: 17 hours ago
See Project
14

Lightpanda Browser

Lightpanda: the headless browser designed for AI and automation

...This design allows it to execute JavaScript and interact with web pages while avoiding the overhead associated with rendering images, fonts, and layout elements intended for visual display. The browser is implemented using the Zig programming language and integrates the V8 JavaScript engine to run modern web applications and scripts efficiently. Because it avoids graphical rendering and other heavy browser components, the system uses significantly less memory and launches almost instantly compared to conventional browsers such as Chrome.

Downloads: 30 This Week

Last Update: 2026-05-13
See Project
15

Agentspan

Durable, Distributed runtime for ALL of your agents

...The system is built for durability, meaning tasks can pause for extended periods, including waiting for human approval, and then resume seamlessly. It supports scaling across multiple environments, making it suitable for production-grade agent orchestration. Agentspan includes a local server and visual interface that allow developers to inspect execution flows and debug agent behavior. It also integrates with multiple model providers, enabling flexibility in selecting underlying AI systems. Overall, it provides infrastructure for building resilient, long-running AI agents rather than short-lived scripts.

Downloads: 1 This Week

Last Update: 2026-05-08
See Project
16

Langflow

Low-code app builder for RAG and multi-agent AI applications

Langflow is a low-code app builder for RAG and multi-agent AI applications. It’s Python-based and agnostic to any model, API, or database.

Downloads: 12 This Week

Last Update: 5 days ago
See Project
17

Wan2.1

Wan2.1: Open and Advanced Large-Scale Video Generative Model

Wan2.1 is a foundational open-source large-scale video generative model developed by the Wan team, providing high-quality video generation from text and images. It employs advanced diffusion-based architectures to produce coherent, temporally consistent videos with realistic motion and visual fidelity. Wan2.1 focuses on efficient video synthesis while maintaining rich semantic and aesthetic detail, enabling applications in content creation, entertainment, and research. The model supports text-to-video and image-to-video generation tasks with flexible resolution options suitable for various GPU hardware configurations. ...

1 Review

Downloads: 39 This Week

Last Update: 2026-03-05
See Project
18

LangGraph Studio

Desktop app for prototyping and debugging LangGraph applications

LangGraph Studio offers a new way to develop LLM applications by providing a specialized agent IDE that enables visualization, interaction, and debugging of complex agentic applications. With visual graphs and the ability to edit state, you can better understand agent workflows and iterate faster. LangGraph Studio integrates with LangSmith so you can collaborate with teammates to debug failure modes. While in Beta, LangGraph Studio is available for free to all LangSmith users on any plan tier. LangGraph Studio requires docker-compose version 2.22.0+ or higher. ...

Downloads: 25 This Week

Last Update: 2025-03-06
See Project
19

CodeMirror MCP

CodeMirror extension to hook up a Model Context Provider (MCP)

The codemirror-mcp project is a CodeMirror extension that integrates the Model Context Protocol (MCP) into the CodeMirror editor. This extension enhances the editor's capabilities by providing features such as autocompletion for resource mentions and prompt commands, as well as visual styling for these elements. It aims to streamline the user experience when working with MCP within the CodeMirror environment.

Downloads: 0 This Week

Last Update: 2025-08-10
See Project
20

dtreeviz

Python library for decision tree visualization & model interpretation

...Visualizing decision trees is a tremendous aid when learning how these models work and when interpreting models. The visualizations are inspired by an educational animation by R2D3; A visual introduction to machine learning. Please see How to visualize decision trees for deeper discussion of our decision tree visualization library and the visual design decisions we made.

Downloads: 0 This Week

Last Update: 2025-12-27
See Project
21

Screenshot to Code

A neural network that transforms a design mock-up into static websites

Screenshot-to-code is a tool or prototype that attempts to convert UI screenshots (e.g., of mobile or web UIs) into code representations, likely generating layouts, HTML, CSS, or markup from image inputs. It is part of a research/proof-of-concept domain in UI automation and image-to-UI code generation. Mapping visual design to code constructs. Code/UI layout (HTML, CSS, or markup). Examples/demo scripts showing “image UI code”.

Downloads: 0 This Week

Last Update: 2025-09-26
See Project
22

ManiSkill

SAPIEN Manipulation Skill Framework

...Developed by Hao Su Lab, it focuses on robotic manipulation with diverse, high-quality 3D tasks designed to challenge perception, control, and planning in robotics. ManiSkill provides both low-level control and visual observation spaces for realistic learning scenarios.

Downloads: 0 This Week

Last Update: 2026-04-21
See Project
23

ViZDoom

Doom-based AI research platform for reinforcement learning

ViZDoom allows developing AI bots that play Doom using only the visual information (the screen buffer). It is primarily intended for research in machine visual learning, and deep reinforcement learning, in particular. ViZDoom is based on ZDOOM, the most popular modern source-port of DOOM. This means compatibility with a huge range of tools and resources that can be used to create custom scenarios, availability of detailed documentation of the engine and tools and support of Doom community. ...

Downloads: 0 This Week

Last Update: 2026-02-11
See Project
24

Qwen3.5

Qwen3.5 is the large language model series developed by Qwen team

...Qwen3.5 builds on earlier Qwen generations by improving multilingual understanding, reasoning ability, and efficiency, while also introducing native multimodal capabilities that allow the model to work with both language and visual inputs. Architecturally, the system leverages modern large-scale training techniques and mixture-of-experts style efficiency so that very large parameter counts can be used while keeping inference practical.

Downloads: 15 This Week

Last Update: 2026-04-22
See Project
25

LTX-2

Python inference and LoRA trainer package for the LTX-2 audio–video

LTX-2 is a powerful, open-source toolkit developed by Lightricks that provides a modular, high-performance base for building real-time graphics and visual effects applications. It is architected to give developers low-level control over rendering pipelines, GPU resource management, shader orchestration, and cross-platform abstractions so they can craft visually compelling experiences without starting from scratch. Beyond basic rendering scaffolding, LTX-2 includes optimized math libraries, resource loaders, utilities for texture and buffer handling, and integration points for native event loops and input systems. ...

Downloads: 22 This Week

Last Update: 2026-05-11
See Project