Showing 39 open source projects for "self-extracing"

View related business solutions
  • Go from Code to Production URL in Seconds Icon
    Go from Code to Production URL in Seconds

    Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try it free
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 1
    BentoPDF

    BentoPDF

    A Privacy First PDF Toolkit

    BentoPDF is a self-hosted, open-source PDF toolkit that provides a suite of local PDF manipulation features for users who want full control over their documents without relying on cloud PDF services. It offers functionality to merge, split, compress, rotate, and convert PDFs through an easy-to-deploy container or local installation, making it ideal for individuals and teams that handle large volumes of PDF files regularly.
    Downloads: 257 This Week
    Last Update:
    See Project
  • 2
    OCRBase

    OCRBase

    MD/.JSON Document OCR and structured data extraction API

    OCRBase is a self-hostable document OCR and structured extraction system built to turn PDFs into machine-usable outputs at scale, aiming to bridge the gap between raw text extraction and production-ready pipelines. Instead of treating OCR as a one-off script, it presents an API-driven workflow where documents are submitted as jobs and processed through a queue-based architecture that can handle high throughput.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    Memvid

    Memvid

    Video-based AI memory library. Store millions of text chunks in MP4

    ...This innovative approach uses standard video containers and offers millisecond-level semantic search across large corpora with dramatically less storage than vector DBs. It's self-contained—no DB needed—and supports features like PDF indexing, chat integration, and cloud dashboards.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    LittleLink

    LittleLink

    A lightweight DIY alternative to services like Linktree

    The DIY self-hosted LinkTree alternative. LittleLink has more than 60 branded button styles you can easily use, with more being added by our community all the time. LittleLink has more than 60 branded button styles you can use (with even more being added by our community). You'll also find a light and dark theme ready to go. Not a fan of the default colors?
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    latexcv

    latexcv

    A collection of cv and resume templates written in LaTeX

    A collection of user-friendly LaTeX CV and résumé templates (packaged within the R Markdown vitae ecosystem), offering simple themes and templates for creating professional CVs without heavy TeX coding. Supports multiple display themes such as classic, modern, sidebar layouts.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    fmt

    fmt

    Formatting library as an alternative to C stdio and C++ iostreams

    {fmt} is an open-source formatting library providing a fast and safe alternative to C stdio and C++ iostreams. The format API is similar in spirit to the C printf family of function but is safer, simpler and several times faster than common standard library implementations. The format string syntax is similar to the one used by str.format in Python. The library is fully type safe, automatic memory management prevents buffer overflow, errors in format strings are reported using exceptions or...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 7
    JBake

    JBake

    Java based open source static site/blog generator for developers

    JBake is a Java-based, open source, static site/blog generator for developers & designers. The project uses Gradle 4.9+ as the build system. We configured the gradle check style Plugin to run with the check task. It does not break the build if convention violations are found. But prints a warning and generates a report. Source available on GitHub, licensed under MIT License. Binary distribution contains everything you need apart from a JRE, with no complicated environment setup with 3rd...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    ECMAScript

    ECMAScript

    Status, process, and documents for ECMA-262

    The most accurate and up-to-date ECMAScript specification. It contains the content of the most recent yearly snapshot plus any finished proposals (those that have reached Stage 4 in the proposal process and thus are implemented in several implementations and will be in the next practical revision) since that snapshot was taken. This Ecma Standard defines the ECMAScript 2022 Language. It is the twelfth edition of the ECMAScript Language Specification. Since publication of the first edition in...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    ujson

    A tiny C++ JSON reader with a very simple API

    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 10
    adx - addressbook.xml

    adx - addressbook.xml

    Minimalistic address book in web browser. No server or plugin needed.

    Minimalistic but full-featured addressbook in your web browser. adx is a standalone and portable web app (online and offline). FEATURES Contact Management, portable, small (~350KB), lightweight, contact tagging, geo mapping, web accounts, trigger phone/Skype calls, etc. EXPORT FUNCTIONALITY vCard (as file or QR code via offline generator) HOW IT WORKS Your address-book (XML file) is transformed in your web browser (via XSLT) to a full-featured web application...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    showyourwork

    showyourwork

    A workflow for reproducible and open scientific articles

    ...If you want to make your research article reproducible, extensible, transparent, or just downright awesome, you’ve found the right tool. showyourwork automates your entire workflow, packaging it into a self-contained recipe that anyone can follow and, at the click of a button, use to reproduce your results.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    JSONlite

    JSONlite

    Self-contained, serverless, zero-configuration, json document store

    A simple, self-contained, serverless, zero-configuration, json document store. JSONlite sandboxes the current working directory similar to SQLite. The JSONlite data directory by default is named jsonlite.data and each json document is validated and saved pretty printed as a uuid.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Code Guide

    Code Guide

    Standards for developing consisten and sustainable HTML and CSS

    ...Use soft tabs with two spaces, they're the only way to guarantee code renders the same in any environment. Nested elements should be indented once (two spaces). Always use double quotes, never single quotes, on attributes. Don't include a trailing slash in self-closing elements, the HTML5 spec says they're optional. Don’t omit optional closing tags (e.g. </li> or </body>). Enforce standards mode and more consistent rendering in every browser possible with this simple doctype at the beginning of every HTML page. Authors are encouraged to specify a lang attribute on the root html element, giving the document's language. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    backslide

    backslide

    CLI tool for making HTML presentations with Remark.js using Markdown

    CLI tool for making HTML presentations with Remark.js using Markdown. Use bs init to create a new presentation along with a template directory in the current directory. The template directory is needed for backslide to transform your Markdown files into HTML presentations. You can create as many markdown presentations as you want in the directory, they will all be based on the same template. Use bs serve to start a development server with live reload. A page will automatically open in your...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    tikzcd-editor

    tikzcd-editor

    A simple visual editor for creating commutative diagrams

    A simple visual editor for creating commutative diagrams.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    xmlfy

    xmlfy

    Convert to XML on the fly

    xmlfy converts text/UTF based output into XML formatted output using schema files and/or options to control its behaviour. By Arthur Gouros.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    OPC2MongoDB

    Free tool to read data from OPC sources and save to a MongoDB database

    This tool allows to connect to OPC UA and DA servers, poll for data and subscribe to events and write data in real time to a MongoDB database. It is simple to configure, the opc2mongodb.conf file is self explained, it must be put in the same folder as the exe file. The code is written in C# and it uses the h-OPC and the oficial MongoDB libraries for C#. Requires the .NET fremework 4.6 or later. Need any help? Create an issue here or contact me. Here is my LinkedIn contact: https://www.linkedin.com/in/ricardo-olsen/. Source code: https://github.com/riclolsen/OPC2MongoDB
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    libdropbox

    Small ANSI C lib for dropbox/windows azure communication

    Small ANSI C lib for dropbox and windows azure communication. Built for small platforms. Using PolarSSL for https communication. Features a small self contained https module and a modified version of the JSMN json parser. Originally based on the dropbox_uploader script. Able to do most dropbox actions. Eg. Upload file, download file, list, file info, account info, share link. Also contains a small CLI programs that interfaces with the lib. Also capable of windows azure service bus communications. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Flex Layout Attribute (FLA)

    Flex Layout Attribute (FLA)

    HTML layout helper based on CSS flexbox specification

    ...Layout, use it on the container (parent) element to set layout of its children elements. And self, use it on container/child element to set size or align for the element itself.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    XML2CSV-Generic-Converter

    XML2CSV-Generic-Converter

    Flatten XML into CSV to suit your mood

    ...I made it in order to extract data from big XML files and gather them in files more easily opened with a spreadsheet because I didn't find anything adapted to my needs over the Internet when I needed to (Java + truly generic + self-contained algorithm + Unix like command line options + efficiency). It is packaged as an auto executable Jar for convenient command line execution but might as well be interfaced directly by a Java class as part of a broader [yet non commercial] software. It handles attributes, repeated elements, and so on, and produces results which level up with what spreadsheets generate when they import native XML (at least in its most extensive execution mode). ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 21
    QAL

    QAL

    Query Abstraction Layer

    ...It has a database abstraction layer that supports connectivity to Postgres, MySQL, DB2, Oracle, MS SQL server. JSON and MongoDB is coming. It uses XML/JSON formats(self-generated SQL schemas) for representing queries, transformation and merging, making it scriptable. This means that QAL can be backend agnostic about a subset of SQL features and data types. Of course custom SQL:s are also supported. It is currently distributed as a Python 3 Library (pip3 install python3-qal) and Debian .deb package. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    BLUI

    BLUI

    Rich HTML UI engine for UE4

    Rich HTML UI engine for UE4. A user widget (UMG) blueprint which embeds a BLU texture as an image brush. Contains various utility functions to allow loading and parsing urls easily. Call InitBluEye with starting url and browser window size to start this widget. See BluiWorldWidgetActorExample for an example of how it's used in practice. Encapsulated BluiWidget user widget in an actor. Drag and drop this actor into your scene and it will auto-spawn the required BluTickActor to make everything...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Chunk, an HTML Template Engine for Java

    Chunk, an HTML Template Engine for Java

    Clean, powerful templates for Java

    A powerful Java Template Engine, great for building HTML or XML docs. Chunk can handle many other needs and situations as well. In-tag filters & default values, multiple snippets per file, layered themes, macros, conditional includes, localization & more.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    FTF Library - C# / WPF

    FTF Library - C# / WPF

    Flexible Tagged Format Library - C# / WPF version

    Flexible Tagged Format (FTF) is a completely flexible storage format using human readable tags, and potentially support storing any type of data. This is the C# / WPF version.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Flexible Tagged Format (FTF) Library

    Flexible Tagged Format (FTF) Library

    Flexible Tagged Format (FTF) Library

    Flexible Tagged Format (FTF) is a completely flexible storage format using human readable tags, and potentially support storing any type of data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next