visual-mingw free download

Showing 376 open source projects for "visual-mingw"

View related business solutions

Artificial Intelligence Mac Clear Filters & Widen Search

$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
Secure File Transfer for Windows with Cerberus by Redwood
Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.

Try for Free
1

1D Visual Tokenization and Generation

This repo contains the code for 1D tokenizer and generator

The 1D Visual Tokenization and Generation project from ByteDance introduces a novel “one-dimensional” tokenizer designed for images: instead of representing images with large grids of 2D tokens (as in many prior generative/image-modeling systems), it compresses images into as few as 32 discrete tokens (or more, optionally) — thereby achieving a very compact, efficient representation that drastically speeds up generation and reconstruction while retaining strong fidelity.

Downloads: 0 This Week

Last Update: 2025-12-02
See Project
2

AWS Toolkit for Visual Studio Code

Local Lambda debug, CodeWhisperer, SAM/CFN syntax, etc.

The AWS Toolkit extension for Visual Studio Code enables you to interact with Amazon Web Services (AWS). Try the AWS Code Sample Catalog to start coding with the AWS SDK. The AWS Explorer provides access to the AWS services that you can work with when using the Toolkit. To see the AWS Explorer, choose the AWS icon in the Activity bar. The Developer Tools panel is a section for developer-focused tooling curated for working in an IDE.

Downloads: 3 This Week

Last Update: 5 days ago
See Project
3

DVC Extension for Visual Studio Code

https://github.com/iterative/vscode-dvc

A Visual Studio Code extension that integrates Data Version Control (DVC) into the development environment, enhancing reproducibility and collaboration for machine learning projects.

Downloads: 1 This Week

Last Update: 2026-03-02
See Project
4

GoogleTest

Google Testing and Mocking Framework

...GoogleTest features an xUnit test framework, a rich set of assertions, user-defined assertions, death tests, among many others. It's been used on a variety of platforms, including Cygwin, Symbian, MinGW and PlatformIO.

Downloads: 13 This Week

Last Update: 2025-04-30
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
5

FaceFusion

Industry leading face manipulation platform

FaceFusion is an open-source face swapping and facial enhancement toolkit designed for high-quality video and image manipulation workflows. The project enables users to replace faces in images or videos while maintaining temporal consistency and visual realism. It integrates modern deep learning models for face detection, alignment, and blending to produce smoother results than traditional approaches. FaceFusion is built with a modular pipeline that allows users to customize processing steps and optimize performance for different hardware environments. The tool is often used in content creation, visual effects experimentation, and research into generative media. ...

Downloads: 313 This Week

Last Update: 2026-04-19
See Project
6

UI-TARS Desktop

A GUI Agent app based on UI-TARS to control your computer using AI

UI-TARS Desktop is a graphical user interface (GUI) agent application that leverages the UI-TARS vision-language model to enable natural language control of computers. This cross-platform tool supports both Windows and macOS, allowing users to perform tasks through intuitive commands. Key features include screenshot-based visual recognition, precise mouse and keyboard control, and real-time feedback on actions. Provides immediate responses and visual feedback on actions performed. The application facilitates seamless interaction with the computer, enhancing user experience by simplifying complex operations into straightforward language instructions. Leverages advanced AI to bridge the gap between visual elements and language commands. ...

1 Review

Downloads: 83 This Week

Last Update: 2025-11-04
See Project
7

Pixelle-Video

AI Fully Automated Short Video Engine

...The system emphasizes modularity, allowing developers to plug in different models or processing steps depending on the use case. It can be used for tasks such as content generation, video editing, or visual storytelling. Overall, Pixelle-Video provides a flexible environment for building AI-powered video generation and processing workflows.

Downloads: 81 This Week

Last Update: 2026-04-22
See Project
8

Next AI Draw.io

A next.js web application that integrates AI capabilities with draw.io

Next AI Draw.io is an AI-enhanced diagramming application that integrates generative intelligence into the familiar draw.io-style visual workflow. The project aims to help users create diagrams, flowcharts, and structured visual content more efficiently by leveraging AI-assisted generation and editing capabilities. It combines modern web technologies with diagram automation features to reduce the manual effort typically required in visual design tools. The system is intended for developers, product teams, and technical planners who need rapid diagram creation within collaborative environments. ...

Downloads: 7 This Week

Last Update: 2026-05-21
See Project
9

DeepSeek-OCR 2

Visual Causal Flow

DeepSeek-OCR-2 is the second-generation optical character recognition system developed to improve document understanding by introducing a “visual causal flow” mechanism, enabling the encoder to reorder visual tokens in a way that better reflects semantic structure rather than strict raster scan order. It is designed to handle complex layouts and noisy documents by giving the model causal reasoning capabilities that mimic human visual scanning behavior, enhancing OCR performance on documents with rich spatial structure. ...

Downloads: 10 This Week

Last Update: 2026-02-03
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
10

n8n

Free and source-available fair-code licensed workflow automation tool

n8n is an extendable workflow automation tool. With a fair-code distribution model, n8n will always have visible source code, be available to self-host, and allow you to add your own custom functions, logic and apps. n8n's node-based approach makes it highly versatile, enabling you to connect anything to everything. n8n has 200+ different nodes to automate workflows.

2 Reviews

Downloads: 835 This Week

Last Update: 1 day ago
See Project
11

Ian Xiaohei Illustrations

Chinese Little Black Weird Text Illustration Generation Skill

Ian Xiaohei Illustrations is a Codex Skill for generating distinctive hand-drawn illustrations for Chinese articles, posts, blogs, Notion documents, and methodology content. It is centered on the “Xiaohei” visual character and turns abstract ideas, judgments, metaphors, and workflows into clear editorial images. The skill is not a generic illustration prompt pack, because it gives AI agents a specific visual system to follow. Its default style uses a 16:9 white-background composition with rough hand-drawn lines and small red, orange, or blue annotation marks. ...

Downloads: 11 This Week

Last Update: 2026-06-01
See Project
12

Dify

One API for plugins and datasets, one interface for prompt engineering

Dify is an easy-to-use LLMOps platform designed to empower more people to create sustainable, AI-native applications. With visual orchestration for various application types, Dify offers out-of-the-box, ready-to-use applications that can also serve as Backend-as-a-Service APIs. Unify your development process with one API for plugins and datasets integration, and streamline your operations using a single interface for prompt engineering, visual analytics, and continuous improvement. ...

Downloads: 28 This Week

Last Update: 2026-05-19
See Project
13

ViMax

Director, Screenwriter, Producer, and Video Generator All-in-One

ViMax is an open-source framework for performing large-scale multi-modal vision-language modeling and reasoning by combining powerful image encoders with advanced language models to solve complex visual tasks. It integrates components like visual encoders, cross-modal fusion techniques, and reasoning modules so that users can go beyond simple captioning or classification to perform tasks such as visual question answering, multi-image inference, and structured scene understanding. ViMax’s design accommodates large image sets and supports retrieval augmentation, enabling it to work with external image databases, supplementary metadata, and semantic search to enhance context awareness. ...

Downloads: 8 This Week

Last Update: 2026-06-08
See Project
14

DESIGN.md

A format specification for describing a visual identity

design.md is an open specification created by Google Labs that defines a standardized way to describe design systems for AI coding agents. It allows developers to encode visual identity elements such as colors, typography, spacing, and components in a structured format. The file combines machine-readable design tokens with human-readable explanations, enabling agents to generate consistent user interfaces aligned with a brand. By providing persistent design context, it eliminates the need to repeatedly describe styling requirements to AI tools. ...

Downloads: 16 This Week

Last Update: 22 hours ago
See Project
15

Ideogram 4

Open image model at the forefront of design

Ideogram 4 is an open-weight text-to-image model focused on high-quality visual generation, design control, and accurate text rendering inside images. It is built for users who need more than generic image generation, especially when layout, typography, composition, color, and language understanding matter. The project introduces a structured JSON prompting workflow that gives creators more explicit control over scene details and visual constraints.

Downloads: 19 This Week

Last Update: 2026-06-05
See Project
16

Graphify

AI coding assistant skill (Claude Code, Codex, OpenCode, OpenClaw)

...The architecture emphasizes flexibility, enabling users to customize how data is mapped and displayed. It may also include analytical features to explore patterns, clusters, or anomalies within the graph. Overall, Graphify serves as a bridge between raw data and visual insight.

Downloads: 13 This Week

Last Update: 17 hours ago
See Project
17

Coze Studio

An AI agent development platform with all-in-one visual tools

Coze Studio is ByteDance’s open‑source, visual AI agent development platform. It offers no-code/low-code workflows to build, debug, and deploy conversational agents, integrating prompting, RAG-based knowledge bases, plugin systems, and workflow orchestration. Developed in Go (backend) and React/TypeScript (frontend), it uses a containerized microservices architecture suitable for enterprise deployment.

Downloads: 7 This Week

Last Update: 2026-01-20
See Project
18

OpenClaw Office

OpenClaw Office is the visual monitoring and management frontend

...Users can observe communication flows between agents through visual connections, track token usage and operational costs, and analyze performance through integrated dashboards and charts. The system also includes live chat capabilities, allowing users to monitor conversations and tool calls as they occur.

Downloads: 5 This Week

Last Update: 2026-05-10
See Project
19

InternGPT

Open source demo platform where you can easily showcase your AI models

InternGPT is an open-source multimodal AI framework designed to extend large language models beyond text interactions into visual reasoning and image manipulation tasks. The system integrates conversational AI with computer vision models so users can interact with images, videos, and visual environments through natural language instructions. Unlike traditional chat systems that rely solely on text prompts, InternGPT allows users to interact with visual content using both language and nonverbal signals such as pointing or highlighting objects within images. ...

Downloads: 0 This Week

Last Update: 2026-03-05
See Project
20

InternLM-XComposer-2.5

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System

...It incorporates visual understanding modules that allow the model to analyze images and integrate them into coherent narrative outputs. The framework also supports tasks such as image captioning, multimodal reasoning, and layout generation for structured visual documents. By combining language generation with visual composition capabilities, the system enables new forms of content creation that integrate written explanations with automatically generated visual components.

Downloads: 0 This Week

Last Update: 2026-03-05
See Project
21

Kimi K2.5

Moonshot's most powerful AI model

Kimi K2.5 is Moonshot AI’s open-source, native multimodal agentic model built through continual pretraining on approximately 15 trillion mixed vision and text tokens. Based on a 1T-parameter Mixture-of-Experts (MoE) architecture with 32B activated parameters, it integrates advanced language reasoning with strong visual understanding. K2.5 supports both “Thinking” and “Instant” modes, enabling either deep step-by-step reasoning or low-latency responses depending on the task. Designed for agentic workflows, it features an Agent Swarm mechanism that decomposes complex problems into coordinated sub-agents executing in parallel. With a 256K context length and MoonViT vision encoder, the model excels across reasoning, coding, long-context comprehension, image, and video benchmarks. ...

Downloads: 22 This Week

Last Update: 2026-05-28
See Project
22

Playwright MCP

Playwright MCP server

An MCP server developed by Microsoft that offers browser automation capabilities using Playwright, enabling LLMs to interact with web pages through structured accessibility snapshots without relying on visual data.

Downloads: 6 This Week

Last Update: 7 days ago
See Project
23

WeChatMsg

Project aimed at extracting, exporting, and analyzing chat records

...It provides tools that read local WeChat database files and allow users to convert chat data into readable formats such as HTML, Word, and CSV, making it possible to inspect conversations outside the mobile app environment. Beyond simple export, the project includes mechanisms for analyzing chat histories and generating annual reports or visual summaries about messaging trends, interaction patterns, and more. The original README communicates a guiding philosophy about owning personal data and using it responsibly to train personalized AI agents or preserve memories. Although the repository has seen periods of inactivity and may not receive frequent updates, its widespread use indicates community interest in preserving chat logs and understanding conversation data outside of the WeChat interface.

Downloads: 285 This Week

Last Update: 2026-02-06
See Project
24

ComfyUI-LTXVideo

LTX-Video Support for ComfyUI

ComfyUI-LTXVideo is a bridge between ComfyUI’s node-based generative workflow environment and the LTX-Video multimedia processing framework, enabling creators to orchestrate complex video tasks within a visual graph paradigm. Instead of writing code to apply effects, transitions, edits, and data flows, users can assemble nodes that represent video inputs, transformations, and outputs, letting them prototype and automate video production pipelines visually. This integration empowers non-programmers and rapid-iteration teams to harness the performance of LTX-Video while maintaining the clarity and flexibility of a dataflow graph model. ...

Downloads: 7 This Week

Last Update: 2026-05-11
See Project
25

PySpur

Visual tool for building, testing, and deploying AI agent workflows

...By offering a visual representation of workflows, PySpur makes it easier to debug interactions between components and identify failures in complex pipelines. It supports iterative experimentation, allowing developers to rapidly improve agents without rebuilding systems from scratch. PySpur also enables deployment of finalized workflows after testing, making it suitable for both development and production use.

Downloads: 2 This Week

Last Update: 2026-03-17
See Project