Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Search Results

Search Results for "data vision" - Page 2

x

Sort By:

Relevance

Clear All Filters

OS

Linux 52
Windows 47
Mac 44
More...
BSD 20
ChromeOS 19
Mobile Operating Systems 2

Category

Artificial Intelligence 38
Software Development 10
Business 8
Formats and Protocols 3
Multimedia 2
Scientific/Engineering 2
Database 1
Education 1
Games 1
Internet 1
Social sciences 1

License

OSI-Approved Open Source 46
Creative Commons Attribution License 3

Translations

English 1

Programming Language

Python 54
C++ 3
MATLAB 2
JavaScript 1
Rust 1
More...
Unix Shell 1

Status

Production/Stable 3
Alpha 2
Beta 2

Showing 54 open source projects for "data vision"

View related business solutions

Python Clear Filters & Widen Search

Build Secure Enterprise Apps Fast with Retool
Stop wasting engineering hours. Build secure, production-grade apps that connect directly to your company’s SQL and APIs.

Create internal software that meets enterprise security standards. Retool connects to your business data—databases, APIs, and vector stores while ensuring compliance with granular permissions and audit logs. Whether on our cloud or self-hosted, build the dashboards and admin panels your organization needs without compromising on security or control.

Learn More
Find Hidden Risks in Windows Task Scheduler
Free diagnostic script reveals configuration issues, error patterns, and security risks. Instant HTML report.

Windows Task Scheduler might be hiding critical failures. Download the free JAMS diagnostic tool to uncover problems before they impact production—get a color-coded risk report with clear remediation steps in minutes.

Download Free Tool
1

CoreNet

CoreNet: A library for training deep neural networks

CoreNet is Apple’s internal deep learning framework for distributed neural network training, designed for high scalability, low-latency communication, and strong hardware efficiency. It focuses on enabling large-scale model training across clusters of GPUs and accelerators by optimizing data flow and parallelism strategies. CoreNet provides abstractions for data, tensor, and pipeline parallelism, allowing models to scale without code duplication or heavy manual configuration. Its distributed runtime manages synchronization, load balancing, and mixed-precision computation to maximize throughput while minimizing communication bottlenecks. ...

Downloads: 0 This Week

Last Update: 2025-10-06
See Project
2

CO3D (Common Objects in 3D)

Tooling for the Common Objects In 3D dataset

CO3Dv2 (Common Objects in 3D, version 2) is a large-scale 3D computer vision dataset and toolkit from Facebook Research designed for training and evaluating category-level 3D reconstruction methods using real-world data. It builds upon the original CO3Dv1 dataset, expanding both scale and quality—featuring 2× more sequences and 4× more frames, with improved image fidelity, more accurate segmentation masks, and enhanced annotations for object-centric 3D reconstruction. ...

Downloads: 3 This Week

Last Update: 4 days ago
See Project
3

DeepSeek-OCR

Contexts Optical Compression

DeepSeek-OCR is an open-source optical character recognition solution built as part of the broader DeepSeek AI vision-language ecosystem. It is designed to extract text from images, PDFs, and scanned documents, and integrates with multimodal capabilities that understand layout, context, and visual elements beyond raw character recognition. The system treats OCR not simply as “read the text” but as “understand what the text is doing in the image”—for example distinguishing captions from body...

Downloads: 15 This Week

Last Update: 2026-01-27
See Project
4

NitroGen

A Foundation Model for Generalist Gaming Agents

NitroGen is a foundation model for generalist gaming agents developed under the MineDojo initiative, aimed at training a vision-action AI that can play and interact with a wide variety of games by taking pixel inputs and predicting gamepad actions. As an open research model, NitroGen is trained on extensive gameplay data spanning thousands of hours and hundreds of games to instill broad, generalizable gaming competency rather than skill at a single title. This approach enables the model to...

Downloads: 2 This Week

Last Update: 4 days ago
See Project
Atera all-in-one platform IT management software with AI agents
Ideal for internal IT departments or managed service providers (MSPs)

Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.

Learn More
5

OSWorld

Benchmarking Multimodal Agents for Open-Ended Tasks

OSWorld is an open-source synthetic world environment designed for embodied AI research and multi-agent learning. It provides a richly simulated 3D world where multiple agents can interact, perform tasks, and learn complex behaviors. OSWorld emphasizes multi-modal interaction, enabling agents to process visual, auditory, and symbolic data for grounded learning in a simulated world.

Downloads: 0 This Week

Last Update: 2025-03-13
See Project
6

HunyuanOCR

OCR expert VLM powered by Hunyuan's native multimodal architecture

HunyuanOCR is an open-source, end-to-end OCR (optical character recognition) Vision-Language Model (VLM) developed by Tencent‑Hunyuan. It’s designed to unify the entire OCR pipeline, detection, recognition, layout parsing, information extraction, translation, and even subtitle or structured output generation, into a single model inference instead of a cascade of separate tools. Despite being fairly lightweight (about 1 billion parameters), it delivers state-of-the-art performance across a...

Downloads: 0 This Week

Last Update: 2026-01-13
See Project
7

Android Use

Automate native Android apps with AI using accessibility APIs

android-action-kernel is an open source Python library designed to let AI agents control and automate native Android applications running on real devices or emulators. It fills a gap in automation tooling by focusing on mobile-first workflows where traditional browser or desktop-based automation doesn’t work; such as logistics, gig work, field operations, and other industries reliant on phones or tablets. The project works by using Android’s accessibility API to extract structured UI state...

Downloads: 4 This Week

Last Update: 3 days ago
See Project
8

CogAgent

An open sourced end-to-end VLM-based GUI Agent

CogAgent is a 9B-parameter bilingual vision-language GUI agent model based on GLM-4V-9B, trained with staged data curation, optimization, and strategy upgrades to improve perception, action prediction, and generalization across tasks. It focuses on operating real user interfaces from screenshots plus text, and follows a strict input–output format that returns structured actions, grounded operations, and optional sensitivity annotations.

Downloads: 2 This Week

Last Update: 5 days ago
See Project
9

Uncertainty Baselines

High-quality implementations of standard and SOTA methods

Uncertainty Baselines is a collection of strong, well-documented training pipelines that make it straightforward to evaluate predictive uncertainty in modern machine learning models. Rather than offering toy scripts, it provides end-to-end recipes—data input, model architectures, training loops, evaluation metrics, and logging—so results are comparable across runs and research groups. The library spans canonical modalities and tasks, from image classification and NLP to tabular problems,...

Downloads: 0 This Week

Last Update: 2026-01-14
See Project
Grafana: The open and composable observability platform
Faster answers, predictable costs, and no lock-in built by the team helping to make observability accessible to anyone.

Grafana is the open source analytics & monitoring solution for every database.

Learn More
10

NVIDIA NeMo Framework

Scalable generative AI framework built for researchers and developers

...The framework builds on PyTorch Lightning–style modular abstractions, so training scripts are composed from reusable components for data loading, models, optimizers, and schedulers, which simplifies experimentation and adaptation. NeMo is designed to scale: with tools like NeMo-Run, users can orchestrate large-scale experiments across thousands of GPUs.

Downloads: 0 This Week

Last Update: 2026-01-09
See Project
11

iJEPA

Official codebase for I-JEPA

...This objective sidesteps generative pixel losses and avoids heavy negative sampling, producing features that transfer strongly with linear probes and minimal fine-tuning. The design scales naturally with Vision Transformer backbones and flexible masking strategies, and it trains stably at large batch sizes. i-JEPA’s predictions are made in embedding space, which is computationally efficient and better aligned with downstream discrimination tasks. The repository provides training recipes, data pipelines, and evaluation code that clarify which masking patterns and architectural choices matter most.

Downloads: 2 This Week

Last Update: 2025-10-07
See Project
12

PIFuHD

High-Resolution 3D Human Digitization from A Single Image

PIFuHD (Pixel-Aligned Implicit Function for 3D human reconstruction at high resolution) is a method and codebase to reconstruct high-fidelity 3D human meshes from a single image. It extends prior PIFu work by increasing resolution and detail, enabling fine geometry in cloth folds, hair, and subtle surface features. The method operates by learning an implicit occupancy / surface function conditioned on the image and camera projection; at inference time it queries dense points to reconstruct a...

Downloads: 4 This Week

Last Update: 2025-10-06
See Project
13

FFCV

Fast Forward Computer Vision (and other ML workloads!)

ffcv is a drop-in data loading system that dramatically increases data throughput in model training. From gridding to benchmarking to fast research iteration, there are many reasons to want faster model training. Below we present premade codebases for training on ImageNet and CIFAR, including both (a) extensible codebases and (b) numerous premade training configurations.

Downloads: 0 This Week

Last Update: 2024-08-07
See Project
14

MAE (Masked Autoencoders)

PyTorch implementation of MAE

MAE (Masked Autoencoders) is a self-supervised learning framework for visual representation learning using masked image modeling. It trains a Vision Transformer (ViT) by randomly masking a high percentage of image patches (typically 75%) and reconstructing the missing content from the remaining visible patches. This forces the model to learn semantic structure and global context without supervision. The encoder processes only the visible patches, while a lightweight decoder reconstructs the...

Downloads: 0 This Week

Last Update: 2025-10-06
See Project
15

gToWbot

Automatically collect resources on your farm

Free and open-source bot writed on python3 for Tales of Wind MMORPG. Using the opencv-python computer vision package! At now: helps you to collect automatically resources (fish, wood, stones) on your farm! Works with all types of translations of ToW gToWbot doesn't collect your personal data and ToW account details! GitHub: https://github.com/grildroid/gToWbot Discord: https://discord.gg/6ZGDgFjDVm

Downloads: 0 This Week

Last Update: 2021-08-17
See Project
16

PyCls

Codebase for Image Classification Research, written in PyTorch

pycls is a focused PyTorch codebase for image classification research that emphasizes reproducibility and strong, transparent baselines. It popularized families like RegNet and supports classic architectures (ResNet, ResNeXt) with clean implementations and consistent training recipes. The repository includes highly tuned schedules, augmentations, and regularization settings that make it straightforward to match reported accuracy without guesswork. Distributed training and mixed precision are...

Downloads: 0 This Week

Last Update: 2025-10-07
See Project
17

PyTorch SimCLR

PyTorch implementation of SimCLR: A Simple Framework

For quite some time now, we know about the benefits of transfer learning in Computer Vision (CV) applications. Nowadays, pre-trained Deep Convolution Neural Networks (DCNNs) are the first go-to pre-solutions to learn a new task. These large models are trained on huge supervised corpora, like the ImageNet. And most important, their features are known to adapt well to new problems. This is particularly interesting when annotated training data is scarce.

Downloads: 0 This Week

Last Update: 2022-08-15
See Project
18

CNN for Image Retrieval

...The repository provides implementations of CNN-based methods to extract feature representations from images and use them for similarity-based retrieval. It focuses on applying deep learning techniques to improve upon traditional handcrafted descriptors by learning features directly from data. The code includes training and evaluation scripts that can be adapted for custom datasets, making it useful for experimenting with retrieval systems in computer vision. By leveraging CNN architectures, the project showcases how learned embeddings can capture semantic similarity across varied images. This resource serves as both an educational reference and a foundation for further exploration in image retrieval research.

Downloads: 1 This Week

Last Update: 3 days ago
See Project
19

PyArmadillo

linear algebra library for Python

PyArmadillo - streamlined linear algebra library for Python, with emphasis on ease of use. Alternative to NumPy / SciPy. * Main page: https://pyarma.sourceforge.io * Documentation: https://pyarma.sourceforge.io/docs.html * Bug reports: https://pyarma.sourceforge.io/faq.html * Git repo: https://gitlab.com/jason-rumengan/pyarma

Downloads: 2 This Week

Last Update: 2023-04-19
See Project
20

VideoPose3D

Efficient 3D human pose estimation in video using 2D keypoint

...By using only 2D detections (such as those from OpenPose or Detectron), it enables markerless 3D pose estimation with relatively lightweight computational requirements. The framework includes pretrained models, data preprocessing utilities, visualization tools, and evaluation scripts for standard benchmarks like Human3.6M. VideoPose3D has been used widely in computer vision research for human motion understanding, activity recognition, and animation generation.

Downloads: 2 This Week

Last Update: 2025-10-07
See Project
21

maskrcnn-benchmark

Fast, modular reference implementation of Instance Segmentation

Mask R-CNN Benchmark is a PyTorch-based framework that provides high-performance implementations of object detection, instance segmentation, and keypoint detection models. Originally built to benchmark Mask R-CNN and related models, it offers a clean, modular design to train and evaluate detection systems efficiently on standard datasets like COCO. The framework integrates critical components—region proposal networks (RPNs), RoIAlign layers, mask heads, and backbone architectures such as...

Downloads: 0 This Week

Last Update: 2025-10-06
See Project
22

JSON2YOLO

Convert JSON annotations into YOLO format.

Explore our state-of-the-art AI architecture to train and deploy your highly accurate AI models like a pro. This directory contains label import/export software developed by Ultralytics LLC, and is freely available for redistribution under the GPL-3.0 license. Ultralytics is a U.S.-based particle physics and AI startup with over 6 years of expertise supporting government, academic, and business clients. We offer a wide range of vision AI services, spanning from simple expert advice up to the...

Downloads: 0 This Week

Last Update: 2023-10-26
See Project
23

SFD

S³FD: Single Shot Scale-invariant Face Detector, ICCV, 2017

S³FD (Single Shot Scale-invariant Face Detector) is a real-time face detection framework designed to handle faces of various sizes with high accuracy using a single deep neural network. Developed by Shifeng Zhang, S³FD introduces a scale-compensation anchor matching strategy and enhanced detection architecture that makes it especially effective for detecting small faces—a long-standing challenge in face detection research. The project builds upon the SSD framework in Caffe, with...

Downloads: 2 This Week

Last Update: 3 days ago
See Project
24

survol

RDF-based framework monitoring business systems activity

A Python agent and a web interface aiming to help the analysis and investigation of a legacy application. A set of machines, processes, databases, programs etc ... all communicating with each other, manipulating your data, and whose software architecture has become, with time, complicated, difficult to understand, and undocumented. Data are aggregated with an RDF inference engine, creating a global vision of the business information processing.

Downloads: 0 This Week

Last Update: 2018-05-13
See Project
25

Open Delays

A program that gather trains delays data

A program that gather trains delays data, so as to make user able to make statistics. Delays are published by train companies, but only the current day is available. Open Delays intends to gather this data in the longer term, so users can have a statistical vision on the train they use, and control and identify structural problems in train exploitation. This also allows to know for example if the train you are stepping in had 15mn delay in 30% of the cases : if you have this kind of information, you know you will have probably a delay.

Downloads: 0 This Week

Last Update: 2014-11-12
See Project

Previous
1
You're on page 2
3
Next

Related Searches

ocr

android

windows boot repair

ocr from pdf

urdu ocr software

tesseract-ocr-w64-setup-v5.x.x.exe

tesseract-ocr-w64-setup-v5.3.3.20231005.exe，64

tesseract-ocr-setup-3.02.02.exe

tesseract-ocr

table

Related Categories

Artificial Intelligence

Software Development

Business

Formats and Protocols

Multimedia

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise

×

Thanks for helping keep SourceForge clean.

X

You seem to have CSS turned off. Please don't fill out this field.

You seem to have CSS turned off. Please don't fill out this field.

Briefly describe the problem (required):

Upload screenshot of ad (required):

Select a file, or drag & drop file here.

✔

✘

Screenshot instructions:

Click URL instructions:
Right-click on the ad, choose "Copy Link", then paste here →
(This may not be possible with some types of ads)

More information about our ad policies

Ad destination/click URL: