Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Search Results

Search Results for "extraction" - Page 5

x

Sort By:

Relevance

Clear All Filters

OS

Linux 201
Windows 182
Mac 171
More...
BSD 88
ChromeOS 77
Mobile Operating Systems 5
Desktop Operating Systems 4

Category

Artificial Intelligence 108
Scientific/Engineering 32
Software Development 29
Multimedia 25
Internet 21
Business 13
Security 11
System 11
Education 5
Formats and Protocols 5
Text Editors 4
Database 1
Productivity 1

License

OSI-Approved Open Source 194
Other License 5
Creative Commons Attribution License 4
Public Domain 1

Translations

Programming Language

Python 219
C++ 14
C 9
Unix Shell 9
Java 5
More...
MATLAB 5
JavaScript 4
TypeScript 4
Perl 3
R 2
Assembly 1
C# 1
Common Lisp 1
Julia 1
PHP 1
Ruby 1
Scilab 1

Status

Production/Stable 21
Beta 15
Alpha 14
Pre-Alpha 2
More...
Mature 1
Inactive 1

Showing 219 open source projects for "extraction"

View related business solutions

Python Clear Filters & Widen Search

Train ML Models With SQL You Already Know
BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.

Try Free
Earn up to 16% annual interest with Nexo.
Access competitive interest rates on your digital assets.

Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.

Get started with Nexo.
1

AudioMuse-AI

AudioMuse-AI is an Open Source Dockerized environment

AudioMuse-AI is an open-source system designed to automatically generate playlists and analyze music libraries using artificial intelligence and audio signal processing techniques. The platform runs locally in a Dockerized environment and performs detailed sonic analysis on audio files to understand characteristics such as tempo, mood, and acoustic similarity. By analyzing the underlying audio content rather than relying on external metadata services, the system can organize large personal...

Downloads: 1 This Week

Last Update: 2026-04-06
See Project
2

OWASP Maryam

Modular OSINT framework for automated open-source intelligence gatheri

...Maryam helps security researchers and analysts streamline routine data-gathering tasks that typically involve searching multiple sources such as Google, Bing, or other online platforms. Maryam organizes its functionality into several modules that focus on different aspects of intelligence gathering, including footprint analysis, OSINT data extraction, and general search operations.

Downloads: 1 This Week

Last Update: 2026-03-08
See Project
3

Perception Models

State-of-the-art Image & Video CLIP, Multimodal Large Language Models

Perception Models is a state-of-the-art framework developed by Facebook Research for advanced image and video perception tasks. It introduces two primary components: the Perception Encoder (PE) for visual feature extraction and the Perception Language Model (PLM) for multimodal decoding and reasoning. The PE module is a family of vision encoders designed to excel in image and video understanding, surpassing models like SigLIP2, InternVideo2, and DINOv2 across multiple benchmarks. Meanwhile, PLM integrates with PE to power vision-language modeling, achieving results competitive with leading multimodal systems such as QwenVL2.5 and InternVL3, all while being fully reproducible with open data. ...

Downloads: 1 This Week

Last Update: 3 days ago
See Project
4

FlexLLMGen

Running large language models on a single GPU

FlexLLMGen is an open-source inference engine designed to run large language models efficiently on limited hardware resources such as a single GPU. The system focuses on high-throughput generation workloads where large batches of text must be processed quickly, such as large-scale data extraction or document analysis tasks. Instead of requiring expensive multi-GPU systems, the framework uses techniques such as memory offloading, compression, and optimized batching to run large models on commodity hardware. The architecture distributes computation and memory usage across the GPU, CPU, and disk in order to maximize the number of tokens processed during inference. ...

Downloads: 0 This Week

Last Update: 2026-03-10
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
5

MiniCPM-o

A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming

MiniCPM-o 2.6 is a cutting-edge multimodal large language model (MLLM) designed for high-performance tasks across vision, speech, and video. Capable of running on end-side devices such as smartphones and tablets, it provides powerful features like real-time speech conversation, video understanding, and multimodal live streaming. With 8 billion parameters, MiniCPM-o 2.6 surpasses its predecessors in versatility and efficiency, making it one of the most robust models available. It supports...

Downloads: 0 This Week

Last Update: 2025-05-15
See Project
6

PokeeResearch-7B

Pokee Deep Research Model Open Source Repo

PokeeResearchOSS provides an open-source, agentic “deep research” model centered on a 7B backbone that can browse, read, and synthesize current information from the web. Instead of relying only on static training data, the agent performs searches, visits pages, and extracts evidence before forming answers to complex queries. It is built to operate end-to-end: planning a research strategy, gathering sources, reasoning over conflicting claims, and writing a grounded response. The repository...

Downloads: 0 This Week

Last Update: 2025-10-27
See Project
7

UCO3D

Uncommon Objects in 3D dataset

...The repository includes automated downloaders with checksum verification, fine-grained controls to fetch only selected modalities or super-categories, and a lightweight Python API for loading frames, geometry, and splats on demand. Metadata is indexed in SQLite for quick queries at scale, and helper builders handle alignment, undistortion, frame extraction from videos, and cropping around the object.

Downloads: 0 This Week

Last Update: 17 hours ago
See Project
8

Patch-NetVLAD

Multi-Scale Fusion of Locally-Global Descriptors for Place Recognition

This repository contains code for the CVPR2021 paper "Patch-NetVLAD: Multi-Scale Fusion of Locally-Global Descriptors for Place Recognition".

Downloads: 6 This Week

Last Update: 2024-07-11
See Project
9

shuyuan

Reading book source

shuyuan is a project oriented around reading and knowledge consumption, especially targeting large-scale text content such as books, articles, or educational material. The name suggests “academy” or “study hall,” and the tool aims to help users ingest, organize, and manage reading content — possibly offering features like text parsing, annotation, metadata generation, translation, or storage for later reference. The repository is set up to support document ingestion, indexing, and maybe some...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
10

tika-python

Python binding to the Apache Tika™ REST services

A Python port of the Apache Tika library that makes Tika available using the Tika REST Server. This makes Apache Tika available as a Python library, installable via Setuptools, Pip and easy to install. To use this library, you need to have Java 7+ installed on your system as tika-python starts up the Tika REST server in the background. To get this working in a disconnected environment, download a tika server file (both tika-server.jar and tika-server.jar.md5, which can be found here) and set...

Downloads: 0 This Week

Last Update: 2025-03-22
See Project
11

SecurePose

Automated Face Blurring, Kinematics Extraction and Leg dystonia Dx

...This validation establishes its effectiveness and usability in clinically recorded gait videos for face blurring and kinematics extraction. For installation, https://www.rishabh-bajpai.com/secureposeinstallation Tutorial Videos https://www.youtube.com/playlist?list=PLO4_jCYO5Ib23MoBpn-Wpj1_b6DAYlDwk Please cite the paper: https://arxiv.org/abs/2402.14143

Downloads: 0 This Week

Last Update: 2024-09-13
See Project
12

iX

Autonomous GPT-4 agent platform

IX is a platform for designing and deploying autonomous and [semi]-autonomous LLM-powered agents and workflows. IX provides a flexible and scalable solution for delegating tasks to AI-powered agents. Agents created with the platform can automate a wide variety of tasks while running in parallel and communicating with each other.

Downloads: 6 This Week

Last Update: 2024-09-02
See Project
13

Datapipe

Real-time, incremental ETL library for ML with record-level depend

Datapipe is a real-time, incremental ETL library for Python with record-level dependency tracking. Datapipe is designed to streamline the creation of data processing pipelines. It excels in scenarios where data is continuously changing, requiring pipelines to adapt and process only the modified data efficiently. This library tracks dependencies for each record in the pipeline, ensuring minimal and efficient data processing.

3 Reviews

Downloads: 153 This Week

Last Update: 2026-02-07
See Project
14

YoungerSibling

YoungerSibling: Cross-platform OSINT tool for quick data gathering.

YoungerSibling is a Python-based terminal utility script designed for educational purposes. It provides a set of useful tools to perform tasks like searching the web, performing lookups (Google search, IP lookup, username lookup, etc.), and extracting metadata from images, directly from the terminal. This project aims to help students, developers, and hobbyists learn about web scraping, API usage, and terminal interaction with Python.

4 Reviews

Downloads: 10 This Week

Last Update: 2024-11-27
See Project
15

Digital Forensics Guide

Learn all about Digital Forensics and Computer Forensics

The Digital Forensics Guide repository is a comprehensive, structured reference for investigators, analysts, students, and cybersecurity professionals interested in digital forensics principles, tools, methodologies, and workflows. It organizes foundational topics such as evidence acquisition, disk and memory analysis, file system structures, network forensics, artifact extraction, timeline generation, and reporting into digestible modules that help build core competency. Alongside conceptual explanations, the guide includes practical examples with widely used tools (like Autopsy, Volatility, Sleuth Kit, and network analysis suites), illustrating how investigations proceed from initial data capture to final analysis. ...

Downloads: 7 This Week

Last Update: 2025-12-11
See Project
16

LabPlot

Data Visualization and Analysis

LabPlot is a FREE, open source and cross-platform Data Visualization and Analysis software accessible to everyone.

4 Reviews

Downloads: 33 This Week

Last Update: 2025-08-18
See Project
17

FAE: FeAture Explorer

FeAture Explorer (FAE), a radiomics (or medical analysis) tool that helps radiologists extract features, preprocess feature matrix, develop machine learning models (Binary Classification & Survival Analysis) with one-click, and evaluate models qualitatively and quantitatively. This project was inspired on the Radiomics, and provides a GUI with convenient process. FAE was initially developed by East China Normal University and Siemens Healthineers Ltd. If FAE could help in your project, We...

Downloads: 7 This Week

Last Update: 2026-03-15
See Project
18

STRIKER-GUI

Refine the spectral library to enhance its completeness and coverage.

STRIKER is a tool for correcting spectra with missing or incorrect adduct annotations. It also enables efficient construction of an HMDB-based spectral library and extraction of sublibraries from large spectral libraries.

Downloads: 0 This Week

Last Update: 2025-12-02
See Project
19

Auditory Modeling Toolbox

The auditory modeling toolbox (AMT) is a Matlab/Octave toolbox for the development and application of auditory computational models. Over 50 auditory models implemented in Matlab, Octave, C, C++, and Python can be run from Matlab and Octave, on Windows and Linux. The AMT provides a well-structured in-code documentation, includes auditory data required to run the models. It integrates functionality to reproduce the model predictions. Model implementations can be evaluated in two stages,...

3 Reviews

Downloads: 28 This Week

Last Update: 2026-04-03
See Project
20

PoseidonQ - AI/ML Based QSAR Modeling

ML based QSAR Modelling And Translation of Model to Deployable WebApps

- This Software was made with an intention to make QSAR/QSPR development more efficient and reproducible. - Published in ACS, Journal of Chemical Information and Modeling . Link : https://pubs.acs.org/doi/10.1021/acs.jcim.4c02372 - Simple to use and no compromise on essential features necessary to make reliable QSAR models. - From Generating Reliable ML Based QSAR Models to Developing Your Own QSAR WebApp. For any feedback or queries, contact kabeermuzammil614@gmail.com - Available on...

Downloads: 34 This Week

Last Update: 2026-03-26
See Project
21

Pixelyse

A simple tool for scanning and digitalizing your photos effortlessly.

Minimizes the number of steps required to perform scanning and photo extraction, making the process quicker and more efficient. A user-friendly interface and streamlined functionality allows you to go from document to extracted photos in just a few clicks. For more information and troubleshooting, check the readme file: https://sourceforge.net/p/pixelyse/code/ci/master/tree/README.md

Downloads: 0 This Week

Last Update: 2024-07-24
See Project
22

AudiooPy

Audio manager in Python Object-Oriented Programming

... - A scientifically validated method for automatically detecting sound segments in speech. - Manipulation of raw audio data. - Audio mixing capabilities. - Automated computation of statistical descriptors for audio data. - Channel extraction. - Channel mixing. AudiooPy is entirely self-contained and does not rely on any external libraries. <https://img.shields.io/pypi/v/AudiooPy>

Downloads: 1 This Week

Last Update: 2025-10-09
See Project
23

AudioBC

Offline desktop app to convert EPUB to MP3 using Kokoro-82M neural TTS

...Privacy-First & Offline: After a one-time initial model download, all processing happens on your CPU. Your books never leave your computer. Multi-Language Support: Curated voices for English (US & UK), Italian, French, Spanish, and Portuguese (BR). Smart Extraction: Automatically filters out non-narrative cont

Downloads: 3 This Week

Last Update: 2026-03-22
See Project
24

garysfm

An advanced file manager with qss themes and iso and folder previews

garysfm which stands for Gary's File Manager is a file manager with some advanced features. Those features include bulk renaming and folder image previews. I has rather advanced search functions, tab browsing with persistence between launches. It remembers your folder sorting and view options in icon view. It also remembers your active tabs between sessions. It has progress dialog while doing large operations like copying large files, and folders with many files. python version works on...

Downloads: 15 This Week

Last Update: 2025-10-20
See Project
25

DeepKE

An Open Toolkit for Knowledge Graph Extraction and Construction

...Since relations are expressed over multiple sentences in real-world applications, DeepKE supports document-level relation extraction. We present a new open-source and extensible knowledge extraction toolkit, called DeepKE, supporting standard fully supervised, low-resource few-shot and document-level scenarios.

Downloads: 0 This Week

Last Update: 2023-09-21
See Project

Previous
1
2
3
4
You're on page 5
6
7
8
9
Next

Related Searches

mega-voice

labplot

•mobile phone forensics tools

osint

•mobile forensics tools

speech synthesis

.mega-voice

osint tools

android forensics tools

plot

Related Categories

Artificial Intelligence

Scientific/Engineering

Software Development

Multimedia

Internet

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise