Showing 252 open source projects for "extract java"

View related business solutions
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing SDK in C++20

    DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. DocWire SDK aims to...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 2
    CERCA

    CERCA

    CERCA – Citation Extraction & Reference Checking Assistant

    CERCA is an open-source research tool that supports the verification of bibliographic references in scientific manuscripts. It extracts references from PDF files and checks their existence and consistency against authoritative metadata sources, producing explainable diagnostics, audit logs, and reproducible reports. It is intended for: - Researchers performing final manuscript checks - Reviewers assessing reference consistency - Editors supporting editorial quality control -...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 3
    WebHarvest - web data extraction tool
    Web data extraction (web data mining, web scraping) tool. It leverages well proved XML and text processing techologies in order to easely extract useful data from arbitrary web pages.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    PDF Tinkerer

    PDF Tinkerer

    Tinker with PDF files

    Tinker with PDF files. Download the JAR file for your OS (e.g. Windows) and double click on it. You will need at least Java 21 (e.g. https://adoptium.net/temurin/releases/?os=any&arch=any&version=21) to run this Desktop-App. The latest releases of PDF Tinkerer can now be found on: https://gitlab.com/gjwu/pdf-tinkerer/-/releases
    Downloads: 1 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    Stipple Effect

    Stipple Effect

    A pixel art editor that supports animation and scripting

    Stipple Effect is a pixel art editor that supports animation and scripting. It is designed to facilitate a variety of workflows and to encourage rapid, iterative creation of video game art assets and other types of artwork. Stipple Effect was made by a solo indie game developer with the needs and skill sets of other indie developers in mind. It is lightweight and simple to learn and use, yet allows for considerable depth and complexity. Stipple Effect has a feature-rich scripting API that...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 6
    Orbit

    Orbit

    ORBIT : Operating Business Intelligence Tool

    ORBIT : Operating Business Intelligence Tool Making Data Accessible Through Centralized Database Access ORBIT is a business intelligence tool designed to make data accessible to a broad audience within your company by centralizing access to databases. With this application, users can easily create reports, perform interactive analyses, and extract insights from raw data. The application simplifies data handling by providing easy-to-use features for non-technical users while maintaining...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    MyBox

    MyBox

    Easy Tools of PDF, Image, File, Network, Data, and Medias

    javafx-desktop-apps pdf image ocr icc barcode color-palette text bytes markdown html archive compress digest video audio editor converter media https://github.com/Mararsh/MyBox Self-contain packages need not java env nor installation. Jar packages need Java 16 or higher.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    YouTube Downloader ArmoMan

    YouTube Downloader ArmoMan

    An android application to download audio (mp3) from YouTube videos.

    An android application to download audio (mp3) from YouTube videos and from playlists. YouTube Downloader ArmoMan is an intuitive Android application designed to empower users to effortlessly extract high-quality MP3 audio from YouTube videos and playlists. With a user-friendly interface and a host of convenient features, this app simplifies the process of downloading and enjoying your favorite YouTube content in audio format. Download from SourceForge or GitHub. GitHub download...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 9
    PoJamas aims to provide a Python and tools for loading, processing, and producing .cr2, pz3 (crz, pzz) files compatible with the SmithMicro (e-frontier) Poser character animation application. PoJamas is composed of: - Python library - Python Wavefront (.obj) 3D viewer based on GLFW - LibreOffice/Python Application (to ease the library and the viewer usage) As of 2020, the project is ported in Python3 As of 2021 this project proposes a 3D viewer for Wavefront files...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 10
    Prime QA

    Prime QA

    State-of-the-art Multilingual Question Answering research

    PrimeQA is a public open source repository that enables researchers and developers to train state-of-the-art models for question answering (QA). By using PrimeQA, a researcher can replicate the experiments outlined in a paper published in the latest NLP conference while also enjoying the capability to download pre-trained models (from an online repository) and run them on their own custom data. PrimeQA is built on top of the Transformers toolkit and uses datasets and models that are directly...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    JSLT

    JSLT

    JSON query and transformation language

    JSLT is a complete query and transformation language for JSON. The language design is inspired by jq, XPath, and XQuery.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Launch4j Executable Wrapper
    Cross-platform Java executable wrapper for creating lightweight Windows native EXEs. Provides advanced JRE search, application startup configuration and better user experience.
    Leader badge
    Downloads: 3,118 This Week
    Last Update:
    See Project
  • 13
    Web Spider, Web Crawler, Email Extractor

    Web Spider, Web Crawler, Email Extractor

    Free Extracts Emails, Phones and custom text from Web using JAVA Regex

    In Files there is WebCrawlerMySQL.jar which supports MySql Connection Please follow this link to get latest version https://sourceforge.net/projects/web-spider-web-crawler-extract/ Free Web Spider & Crawler. Extracts Information from Web by parsing millions of pages. Store data into Derby OR MySQL Database and data are not being lost after force closing the spider. - Free Web Spider , Parser, Extractor, Crawler - Extraction of Emails , Phones and Custom Text from Web - Export to Excel File - Data Saved into Derby Database - Written in Java Cross Platform See also Free Email Sender in this link: https://sourceforge.net/projects/gitst-free-email-ender/ Please install Microsoft OpenJDK to start the application https://www.microsoft.com/openjdk
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    7-Zip-JBinding

    7-Zip-JBinding

    Java wrapper for 7z archiver engine

    Native (JNI) cross-platform library to extract (password protected, multi-part) 7z Zip Rar Tar Split Lzma Iso HFS GZip Cpio BZip2 Z Arj Chm Lhz Cab Nsis Deb Rpm Wim Udf archives and create 7z, Zip, Tar, GZip & BZip2 from Java.
    Leader badge
    Downloads: 32 This Week
    Last Update:
    See Project
  • 15

    JavaTranslator

    Utility for Java program internationalization

    An useful application to help in java applications translation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    PDFLayoutTextStripper

    PDFLayoutTextStripper

    Converts a pdf file into a text file while keeping the layout

    Converts a PDF file into a text file while keeping the layout of the original PDF. Useful to extract the content from a table or a form in a PDF file. PDFLayoutTextStripper is a subclass of PDFTextStripper class (from the Apache PDFBox library).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    QuickNoteCLI

    QuickNoteCLI

    QNC is a command line interface app for creating quick notes

    usage: QNC -a,--append Append text to the last note -c,--clear Clear all notes -d,--delete <arg> Delete note by index or name -e,--erase Erase last note -h,--help Print help -l,--list Print note list -n,--name <arg> Specify note name -N,--nano Open in nano editor (If installed) -p,--print Print last note text -r,--rename <arg> Rename last note -s,--show <arg> Show note by name -S,--dbs Start DB server On Windows you can also use WIN + R, qnc <args> Manual start: java -jar QNC-0.1.0.jar Installation with scripts: Extract files from zip to any directory WINDOWS: run install.bat UNIX: run install.sh or execute command in terminal: alias qnc="java -jar /path/to/jar/QNC-0.1.0.jar" https://github.com/DeMmAge/QuickNoteCLI
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Story Inspector

    Story Inspector

    Book analysis tool using AI.

    Story Inspector is a new book analysis tool that uses Artificial Intelligence to automatically extract information about works of fiction. It provides authors with insights about their manuscripts in order to improve their craft. It is published under the GPLv2 license. The source code is available at: https://github.com/taciano-perez/story-inspector
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19

    QToaster

    A simple and fast frontend for the QEMU Emulator - Freeware.

    ...If something works on QEMU then it has a good chance of working on Android. You don't need to put QEMU in your environment path, just point to its base folder. It doesn't need any special runtimes (java, .net, Python, etc.), it only needs the utility file Lutil.dll. Frontend was made for: http://69.170.215.148/nothoro/software/QTPage/QToaster.html QEMU emulator version 6.0.0-rc2 available at: https://qemu.weilnetz.de/w64/2021/ When you run the installer, it will extract to a subfld
    Downloads: 12 This Week
    Last Update:
    See Project
  • 20
    Marathon -GUI Test Runner Web, Swing, FX

    Marathon -GUI Test Runner Web, Swing, FX

    Marathon supports testing of Java/Swing and Java/Fx applications.

    Marathon provides an integrated environment for test script creation and execution. Supported FW: Web, Java Swing/ Java FX. Currently, Marathon supports JRuby script models for recording the test scripts.Marathon test runner generates Allure test reports.Marathon allows for grouping of test cases.It also has an option of inserting modules while recording(in MarathonITE while recording). It allows tester to inset checklist while recording and also takes screen capture and annotate it in...
    Leader badge
    Downloads: 7 This Week
    Last Update:
    See Project
  • 21
    RapidMiner -- Data Mining, ETL, OLAP, BI
    ETL, data warehousing, data mining, OLAP, business intelligence (BI) in Java. 500+ modules: extract, transform, load (ETL), data mining, data analysis + Weka, statistical forecasting, preprocessing, validation, visualization, OLAP, business intelligence.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 22
    Mojito

    Mojito

    An automation platform that enables continuous localization

    Mojito is a continuous localization platform. Rely on continuous integration to collect all of your software strings in one place. Check what products need localization in real-time. Create and import translation packages with a single click. Search and edit translations across all products and languages! And if you have a small dedicated translation team, they can work directly in Mojito.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    xml2csv.jar

    Command-line xml to csv tool

    ...Convert all xml nodes to csv java -jar xml2csv-1.0.jar test/cd_catalog.xml 4. Convert xml to csv, like extract only nodes /CATALOG/CD/TITLE, /CATALOG/CD/ARTIST, /CATALOG/CD/PRICE in format PostgreSQLCsv java -jar xml2csv-1.0.jar -f PostgreSQLCsv -o '/CATALOG/CD/TITLE, /CATALOG/CD/ARTIST, /CATALOG/CD/PRICE' test/cd_catalog.xml
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    BinlogMiner is a Open Source parser Library and Tools for MySQL binlog, Which can easy to decode MySQL binlog event's contents and flashback your table/database by generated executable UNDO/REDO statements.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    TIES

    TIES

    A smart search engine for medical documents

    TIES (Text Information Extraction System) is a clinical text search engine that uses Natural Language Processing techniques to extract medical concepts from free text clinical reports. It provides secure de-identified access to this information and has in built collaboration tools and honest broker functionality. It is licensed for academic use under the BSD license. For commercial use please contact Nexi at http://nexihub.com *** NOTICE: this software and forum are no longer...
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB