Page 8 | visual\ free download

Showing 487 open source projects for "visual\"

View related business solutions

Python Clear Filters & Widen Search

Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
Compliant and Reliable File Transfers Backed by Top Security Certifications
Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.

Start Free Trial
1

Oasis

Inference script for Oasis 500M

Open-Oasis provides inference code and released weights for Oasis 500M, an interactive world model that generates gameplay frames conditioned on user keyboard input. Instead of rendering a pre-built game world, the system produces the next visual state via a diffusion-transformer approach, effectively “imagining” the world response to your actions in real time. The project focuses on enabling action-conditional frame generation so developers can experiment with interactive, model-generated environments rather than static video generation alone. Because it’s an inference-focused repository, it’s especially useful as a practical reference for running the model, wiring inputs, and producing the autoregressive sequence of gameplay frames. ...

Downloads: 0 This Week

Last Update: 2026-01-06
See Project
2

Multimodal

TorchMultimodal is a PyTorch library

...The design emphasizes composability: you can mix and match encoder, fusion, and decoder components rather than starting from monolithic models. The repository also includes example scripts and datasets for common multimodal tasks (e.g. retrieval, visual question answering, grounding) so you can test and compare models end to end. Installation supports both CPU and CUDA, and the codebase is versioned, tested, and maintained.

Downloads: 0 This Week

Last Update: 2026-01-12
See Project
3

MetaCLIP

ICLR2024 Spotlight: curation/training code, metadata, distribution

...It includes utilities to fine-tune vision-language embeddings, compute prompt or adapter updates, and benchmark across transfer and retention metrics. MetaCLIP is especially suited for real-world settings where a model must continuously incorporate new visual categories or domains over time.

Downloads: 0 This Week

Last Update: 2025-10-07
See Project
4

VGGT

[CVPR 2025 Best Paper Award] VGGT

VGGT is a transformer-based framework aimed at unifying classic visual geometry tasks—such as depth estimation, camera pose recovery, point tracking, and correspondence—under a single model. Rather than training separate networks per task, it shares an encoder and leverages geometric heads/decoders to infer structure and motion from images or short clips. The design emphasizes consistent geometric reasoning: outputs from one head (e.g., correspondences or tracks) reinforce others (e.g., pose or depth), making the system more robust to challenging viewpoints and textures. ...

Downloads: 0 This Week

Last Update: 2025-10-11
See Project
Earn up to 16% annual interest with Nexo.
Access competitive interest rates on your digital assets.

Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.

Get started with Nexo.
5

LLaMA-Mesh

Unifying 3D Mesh Generation with Language Models

...By serializing 3D geometry into text tokens, the approach allows existing transformer architectures to generate and interpret 3D models without requiring specialized visual tokenizers. The project includes a supervised fine-tuning dataset composed of interleaved text and mesh data, allowing the model to learn relationships between textual descriptions and 3D structures. As a result, the model can generate mesh models directly from text prompts, explain mesh structures in natural language, or output mixed text-and-mesh sequences. ...

Downloads: 0 This Week

Last Update: 2026-03-09
See Project
6

AI-Codereview-Gitlab

GitLab automatic code review tool based on large models

AI-Codereview-Gitlab is an open-source automation tool that integrates large language models into the GitLab development workflow to perform automated code reviews. The system monitors GitLab repositories and analyzes commits or merge requests using AI models to identify potential issues, coding mistakes, and quality improvements before the code is merged. By leveraging multiple large language model providers—including OpenAI, DeepSeek, ZhipuAI, or local models through Ollama—the platform...

Downloads: 0 This Week

Last Update: 2026-03-15
See Project
7

InfiniteYou

Flexible Photo Recrafting While Preserving Your Identity

...Using an architecture built around diffusion transformers (DiTs), InfiniteYou introduces a component called InfuseNet that injects identity features derived from reference images into the generation process — via residual connections — so that the output matches the person’s identity closely, without sacrificing visual quality or text-image alignment. The team uses a multi-stage training strategy with synthetic multi-sample data per identity to fine-tune for both identity consistency and aesthetic quality. Compared to prior methods, InfiniteYou significantly improves on identity similarity, text-prompt adherence, overall image quality, and avoids common problems such as face copy-pasting artifacts.

Downloads: 0 This Week

Last Update: 2025-12-02
See Project
8

shot-scraper

A command-line utility for taking automated screenshots of websites

shot-scraper is a command-line utility for taking automated screenshots of web pages using a headless browser engine. After installation, a single command can capture a full-page screenshot of a URL and save it to a file, making it ideal for documentation, monitoring, and visual regression tasks. Under the hood it uses a modern browser (installed via a one-time shot-scraper install step) and exposes options for viewport size, full-page versus clipped screenshots, and device emulation. Beyond simple captures, it can run custom JavaScript before taking the shot, allowing you to open menus, scroll, or manipulate the DOM so the screenshot reflects the desired state. ...

Downloads: 0 This Week

Last Update: 2026-02-01
See Project
9

HunyuanOCR

OCR expert VLM powered by Hunyuan's native multimodal architecture

HunyuanOCR is an open-source, end-to-end OCR (optical character recognition) Vision-Language Model (VLM) developed by Tencent‑Hunyuan. It’s designed to unify the entire OCR pipeline, detection, recognition, layout parsing, information extraction, translation, and even subtitle or structured output generation, into a single model inference instead of a cascade of separate tools. Despite being fairly lightweight (about 1 billion parameters), it delivers state-of-the-art performance across a...

Downloads: 0 This Week

Last Update: 2026-04-08
See Project
Enterprise-grade ITSM, for every business
Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.

Try it Free
10

MiniMax-01

Large-language-model & vision-language-model based on Linear Attention

MiniMax-01 is the official repository for two flagship models: MiniMax-Text-01, a long-context language model, and MiniMax-VL-01, a vision-language model built on top of it. MiniMax-Text-01 uses a hybrid attention architecture that blends Lightning Attention, standard softmax attention, and Mixture-of-Experts (MoE) routing to achieve both high throughput and long-context reasoning. It has 456 billion total parameters with 45.9 billion activated per token and is trained with advanced parallel...

Downloads: 0 This Week

Last Update: 2025-12-01
See Project
11

Qwen-VL

Chat & pretrained large vision language model

Qwen-VL is Alibaba Cloud’s vision-language large model family, designed to integrate visual and linguistic modalities. It accepts image inputs (with optional bounding boxes) and text, and produces text (and sometimes bounding boxes) as output. The model variants (VL-Plus, VL-Max, etc.) have been upgraded for better visual reasoning, text recognition from images, fine-grained understanding, and support for high image resolutions / extreme aspect ratios.

Downloads: 2 This Week

Last Update: 2025-09-23
See Project
12

airda

airda(Air Data Agent

airda(Air Data Agent) is a multi-smart body for data analysis, capable of understanding data development and data analysis needs, understanding data, generating data-oriented queries, data visualization, machine learning and other tasks of SQL and Python codes.

Downloads: 1 This Week

Last Update: 2024-09-03
See Project
13

Luna AI

Virtual AI anchor that combines state-of-the-art technology

Luna AI is a virtual AI streamer framework designed to power an interactive VTuber that can go live on major platforms and chat with viewers in real time. It is built around a core assistant persona called “Luna AI,” which can be driven by a wide range of large language models and platforms, including GPT-style APIs, Claude, LangChain-based backends, ChatGLM, Kimi, Ollama, and many others. The project supports multiple rendering backends for the avatar, such as Live2D, Unreal Engine (UE),...

Downloads: 14 This Week

Last Update: 2025-11-28
See Project
14

DWSIM - Open Source Process Simulator

Simulate chemical processes using advanced thermodynamic models

DWSIM is an open source, CAPE-OPEN compliant chemical process simulator for Windows, Linux and macOS systems. Written in VB.NET and C#, DWSIM features a comprehensive set of unit operations, advanced thermodynamic models, support for reacting systems, petroleum characterization tools and a fully-featured graphical interface. DWSIM Pro is a commercial sibling of DWSIM built on top of open-source software. It offers extended features, comes with private support, and is accessible in the...

Downloads: 2,347 This Week

Last Update: 2025-10-28
See Project
15

SCons

A software construction tool

SCons is a software construction tool that is a superior alternative to the classic "Make" build tool that we all know and love. SCons is implemented as a Python script and set of modules, and SCons "configuration files" are actually executed as Python scripts. This gives SCons many powerful capabilities not found in other software build tools. We make SCons available in three distinct packages, for different purposes. - The scons package is the basic package to install SCons. You...

28 Reviews

Downloads: 2,425 This Week

Last Update: 2025-11-16
See Project
16

NebiOS Alternative Mirror (Official)

A unique Ubuntu-based Linux OS with its own desktop and ecosystem.

NebiOS is an Ubuntu LTS-based Linux distribution with a custom desktop environment (NebiDE), built by one person. It combines a unique visual identity (Flat-Skeuomorphism), a widget-first desktop, and its own app format (.napp) into a cohesive platform — not just another reskin. Official mirror — use if the main site has download issues. Main site: https://nebios.org 📌 This mirror page is updated less frequently than the main site. For the latest information, visit https://nebios.org ⚠️ Don't download from unknown sources...

1 Review

Downloads: 2,014 This Week

Last Update: 2026-05-04
See Project
17

PySchool

Installable / Portable Python Distribution for Everyone.

PySchool is a free and open-source Python distribution intended primarily for students who learn Python and data analysis, but it can also used by scientists, engineering, and data scientists. It includes more than 150 Python packages (full edition) including numpy, pandas, scipy, sympy, keras, scikit-learn, matplotlib, seaborn, beautifulsoup4...

1 Review

Downloads: 762 This Week

Last Update: 2026-04-01
See Project
18

OpenPiano — Virtual Piano for Windows

Desktop piano playable with a PC keyboard, mouse, or MIDI device.

OpenPiano is a Windows desktop piano application that allows you to play, practice, and record music using your PC keyboard, mouse, or a MIDI device. It supports real-time playback using SoundFonts and provides on-screen piano layouts for visual feedback while playing. OpenPiano is designed to run entirely locally. It does not require accounts, cloud services, or an internet connection for core functionality. Project links: Website: https://www.justagwas.com/projects/openpiano GitHub: https://github.com/Justagwas/openpiano Documentation: https://github.com/Justagwas/openpiano/wiki The application is fully open source. ...

Downloads: 195 This Week

Last Update: 2026-02-27
See Project
19

LazyDS4

Transform your DualShock 4 into a native Xbox 360 controller. v2.1.0

...Optimized for simplicity and performance, it offers real-time input mapping with a ~1ms (1000Hz) polling rate, modern graphical interface with PyQt5, and automatic joystick calibration. 📌 What's New (v2.1.0) 🎨 Visual Overhaul — Modern, professional interface with gradient backgrounds and advanced styling 📐 Layout Optimization — Improved spacing, alignment, and responsiveness 🎉 Icon Integration — Consistent use of dynamic SVG icons throughout ⚡ Enhanced Button Design — Polished buttons with smooth animations and clear actions 📊 Responsive Interface — Adapted for various screen sizes without overlaps 🔔 Real-time Visual Indicators — Clear and immediate feedback for status, warnings, and battery levels Built by the lazy, for the lazy.

Downloads: 16 This Week

Last Update: 2025-10-20
See Project
20

OculiX

Visual Automation IDE — automate anything you see on screen

...Key features: - Guided step-by-step recorder with live code preview - Image recognition via OpenCV 4.10 - Dual OCR: Tesseract (built-in) + PaddleOCR (neural, high precision) - Local and remote automation via integrated VNC - SSH tunnels via embedded JSch - Cross-platform: Windows, macOS (Apple Silicon M1-M4), Linux - Scripting: Jython, JRuby, Java, PowerShell, AppleScript - Java 17 recommended (Java 8+ supported) - Full CI/CD with automated builds for all platforms Used worldwide for test automation, RPA, and visual regression testing. MIT License. Maintained by oculix-org.

Downloads: 93 This Week

Last Update: 4 days ago
See Project
21

HuMidi

Play MIDI like a human in ROBLOX with automatic sustain pedals.

...The pedals are generated with a thorough analysis of the MIDI data, adding more depth to your performance that no other MIDI players could! Personally have been tested in: - Visual Pianos - Starving Pianists - Digital Piano - Piano's Got Talent This tool is universal, as long as the piano in-game allows to be played on a keyboard.

1 Review

Downloads: 423 This Week

Last Update: 2026-03-18
See Project
22

Universal runtime installer

This installer allows to install the latest Windows Runtimes

This installer allows to install the latest Visual C++ Runtime of all years (2008-2022), Full DirectX Runtime, Microsoft XNA Framework, .Net Runtime, Java and OpenAL at once.

Downloads: 36 This Week

Last Update: 5 days ago
See Project
23

Caesium - Image Compressor

...THIS PROJECT HAS BEEN MOVED!! https://github.com/Lymphatus/caesium-image-compressor Caesium reduces the size of your picture up to 90%, preserving the original visual quality. Allows you to save a lot of space and easily upload your pictures on the web in a moment. The software is user-friendly with a simple and clear interface.

19 Reviews

Downloads: 73 This Week

Last Update: 2025-08-11
See Project
24

Gammu

Cellular manager for mobile phones/modems

Gammu is a cellular manager for mobile phones/modems. It contains libraries and functions for ringtones,logos,phonebook,SMS,etc. (used by external software), a command line version (with backup/restore) and SMS gateway (with MySQL and PostgreSQL supp

24 Reviews

Downloads: 26 This Week

Last Update: 2026-02-09
See Project
25

FreeImage

FreeImage is a library project for developers who would like to support popular graphics image formats (PNG, JPEG, TIFF, BMP and others). Some highlights are: extremely simple in use, not limited to the local PC (unique FreeImageIO) and Plugin driven!

36 Reviews

Downloads: 2,020 This Week

Last Update: 2024-04-17
See Project