Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence
Computer Vision Libraries
Search Results

Search Results for "visual"

x

Sort By:

Relevance

Clear All Filters

OS

Mac 17
Linux 17
Windows 17
More...
BSD 7
ChromeOS 7

Category

Artificial Intelligence 17
Software Development 2

License

OSI-Approved Open Source 11
Creative Commons Attribution License 1

Programming Language

Python 9
C++ 1
Go 1
JavaScript 1
More...
MATLAB 1

Showing 17 open source projects for "visual"

View related business solutions

Computer Vision Libraries Mac Clear Filters & Widen Search

Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
Context for your AI agents
Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.

Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.

Try for free
1

R1-V

Witness the aha moment of VLM with less than $3

R1-V is an initiative aimed at enhancing the generalization capabilities of Vision-Language Models (VLMs) through Reinforcement Learning in Visual Reasoning (RLVR). The project focuses on building a comprehensive framework that emphasizes algorithm enhancement, efficiency optimization, and task diversity to achieve general vision-language intelligence and visual/GUI agents. The team's long-term goal is to contribute impactful open-source research in this domain.

Downloads: 0 This Week

Last Update: 2025-03-19
See Project
2

LLaVA

Visual Instruction Tuning: Large Language-and-Vision Assistant

Visual instruction tuning towards large language and vision models with GPT-4 level capabilities.

Downloads: 1 This Week

Last Update: 2024-02-04
See Project
3

Screenshot to Code

A neural network that transforms a design mock-up into static websites

Screenshot-to-code is a tool or prototype that attempts to convert UI screenshots (e.g., of mobile or web UIs) into code representations, likely generating layouts, HTML, CSS, or markup from image inputs. It is part of a research/proof-of-concept domain in UI automation and image-to-UI code generation. Mapping visual design to code constructs. Code/UI layout (HTML, CSS, or markup). Examples/demo scripts showing “image UI code”.

Downloads: 2 This Week

Last Update: 2025-09-26
See Project
4

Phi-3-MLX

Phi-3.5 for Mac: Locally-run Vision and Language Models

Phi-3-Vision-MLX is an Apple MLX (machine learning on Apple silicon) implementation of Phi-3 Vision, a lightweight multi-modal model designed for vision and language tasks. It focuses on running vision-language AI efficiently on Apple hardware like M1 and M2 chips.

Downloads: 1 This Week

Last Update: 2025-03-13
See Project
Desktop and Mobile Device Management Software
It's a modern take on desktop management that can be scaled as per organizational needs.

Desktop Central is a unified endpoint management (UEM) solution that helps in managing servers, laptops, desktops, smartphones, and tablets from a central location.

Learn More
5

VGGT

[CVPR 2025 Best Paper Award] VGGT

VGGT is a transformer-based framework aimed at unifying classic visual geometry tasks—such as depth estimation, camera pose recovery, point tracking, and correspondence—under a single model. Rather than training separate networks per task, it shares an encoder and leverages geometric heads/decoders to infer structure and motion from images or short clips. The design emphasizes consistent geometric reasoning: outputs from one head (e.g., correspondences or tracks) reinforce others (e.g., pose or depth), making the system more robust to challenging viewpoints and textures. ...

Downloads: 1 This Week

Last Update: 2025-10-11
See Project
6

GoCV

Go package for computer vision using OpenCV 4 and beyond

...Our mission is to make the Go language a “first-class” client compatible with the latest developments in the OpenCV ecosystem. Computer Vision (CV) is the ability of computers to process visual information, and perform tasks normally associated with those performed by humans. CV software typically processes video images, then uses the data to extract information in order to do something useful. Since memory allocations for images in GoCV are done through C based code, the go garbage collector will not clean all resources associated with a Mat. ...

Downloads: 1 This Week

Last Update: 2026-01-05
See Project
7

MetaCLIP

ICLR2024 Spotlight: curation/training code, metadata, distribution

...It includes utilities to fine-tune vision-language embeddings, compute prompt or adapter updates, and benchmark across transfer and retention metrics. MetaCLIP is especially suited for real-world settings where a model must continuously incorporate new visual categories or domains over time.

Downloads: 0 This Week

Last Update: 2025-10-07
See Project
8

ConvNeXt

Code release for ConvNeXt model

...It revisits classic ResNet-style backbones through the lens of transformer design trends—large kernel sizes, inverted bottlenecks, layer normalization, and GELU activations—to bridge the performance gap between convolutions and attention-based models. ConvNeXt’s clean, hierarchical structure makes it efficient for both pretraining and fine-tuning across a wide range of visual recognition tasks. It achieves competitive or superior results on ImageNet and downstream datasets while being easier to deploy and train than transformers. The repository provides pretrained models, training recipes, and ablation studies demonstrating how incremental design choices collectively yield state-of-the-art performance.

Downloads: 0 This Week

Last Update: 2025-10-06
See Project
9

DensePose

A real-time approach for mapping all human pixels of 2D RGB images

...The repository includes the DensePose network architecture, training code, pretrained models, and dataset tools for annotation and visualization. DensePose is widely used in augmented reality, motion capture, virtual try-on, and visual effects applications because it enables real-time 3D human mapping from 2D inputs. The model architecture builds on Mask R-CNN, using additional regression heads to predict UV coordinates that map image pixels to 3D surfaces.

Downloads: 5 This Week

Last Update: 2025-10-06
See Project
Taking the Paper Out of Work
For organizations that need powerful ECM and document automation software

The Square 9 AI-powered intelligent document processing platform takes the paper out of work and makes it easier to get things done with digital workflows.

Learn More
10

PyTorch SimCLR

PyTorch implementation of SimCLR: A Simple Framework

For quite some time now, we know about the benefits of transfer learning in Computer Vision (CV) applications. Nowadays, pre-trained Deep Convolution Neural Networks (DCNNs) are the first go-to pre-solutions to learn a new task. These large models are trained on huge supervised corpora, like the ImageNet. And most important, their features are known to adapt well to new problems. This is particularly interesting when annotated training data is scarce. In situations like this, we take the...

Downloads: 0 This Week

Last Update: 2022-08-15
See Project
11

Butteraugli

Estimates the psychovisual difference between two images

...The metric has become a common yardstick for objective image quality when comparing codecs or encoder tweaks that target web or mobile delivery. Because it is deterministic and fast, it can be used in automated pipelines to gate releases on visual quality, not just file size.

Downloads: 0 This Week

Last Update: 2025-10-10
See Project
12

Show Facebook Computer Vision Tags

Chrome Extension that displays automated image tags from Facebook

...Since Facebook uses a computer-vision model to analyse user-uploaded images and generate alt-text tags for accessibility (e.g., “Image may contain: golf, grass, outdoor and nature”), this extension surfaces those hidden tags directly in the UI—revealing what kind of information Facebook infers about images (objects present, activities being done, environment). The purpose is educational and somewhat cautionary: to help users understand the scope of visual inference and privacy issues. Once installed, the extension overlays those tags on images in the timeline, making visible what is typically hidden metadata. The project is relatively lightweight but has garnered attention due to its privacy transparency angle.

Downloads: 0 This Week

Last Update: 2025-11-14
See Project
13

Portable Robotics Eye Vergence Control

Eye movements control portable on different robotic stereo heads

This project provides a software module for the control of the binocular coordination of a robotic stereo head, based on a bio-inspired algorithm. The project is now available for the iCub platform to work on YARP [https://github.com/stino78/vergence-control/][1] The algorithm works on the top of a distributed representation of binocular disparity supplied by a population of binocular energy-model neural units. The project allows a robust control and adaptive binocular coordination...

1 Review

Downloads: 0 This Week

Last Update: 2018-07-01
See Project
14

ECO

Matlab implementation of the ECO tracker

ECO (Efficient Convolution Operators for Tracking) is a high-performance object tracking algorithm developed by Martin Danelljan and collaborators. It is based on discriminative correlation filters and designed to handle appearance changes, occlusions, and scale variations in visual object tracking tasks. The code provides a MATLAB implementation of the ECO and ECO-HC (high-speed) variants and was one of the top performers on multiple visual tracking benchmarks.

Downloads: 0 This Week

Last Update: 2025-07-24
See Project
15

Awesome Recurrent Neural Networks

A curated list of resources dedicated to RNN

A curated list of resources dedicated to recurrent neural networks (closely related to deep learning). Provides a wide range of works and resources such as a Recurrent Neural Network Tutorial, a Sequence-to-Sequence Model Tutorial, Tutorials by nlintz, Notebook examples by aymericdamien, Scikit Flow (skflow) - Simplified Scikit-learn like Interface for TensorFlow, Keras (Tensorflow / Theano)-based modular deep learning library similar to Torch, char-rnn-tensorflow by sherjilozair, char-rnn...

Downloads: 0 This Week

Last Update: 2021-09-22
See Project
16

Robogathers

A simple opensource 3d network game

Robogathers is a 3d game, developed mainly for academic purposes to teach future software engineers with 3d graphic processing, mathematical modelling, networking and multithreading. In the far future we also want to use this project as a base for scientific research in fields of visual odometry, simultaneous localization and mapping and computer vision. The game consist in driving your robot gathering good prizes and avoiding bad ones. The game is multiplayer and during it you can compete with other people via network as well as with AI robots.

Downloads: 0 This Week

Last Update: 2012-12-28
See Project
17

computer-vision-datasets

Various datasets from my computer vision research primarily useful for lane detection and visual servoing algorithms.

Downloads: 0 This Week

Last Update: 2016-07-25
See Project

Previous
You're on page 1
Next

Related Searches

phi

robot designing software

recurrent neural networks matlab

lane detection

robotics

lane detection and tracking

Related Categories

Artificial Intelligence

Software Development

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise

×

Thanks for helping keep SourceForge clean.

X

You seem to have CSS turned off. Please don't fill out this field.

You seem to have CSS turned off. Please don't fill out this field.

Briefly describe the problem (required):

Upload screenshot of ad (required):

Select a file, or drag & drop file here.

✔

✘

Screenshot instructions:

Click URL instructions:
Right-click on the ad, choose "Copy Link", then paste here →
(This may not be possible with some types of ads)

More information about our ad policies

Ad destination/click URL: