Showing 316 open source projects for "data processing"

View related business solutions
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    Roarr

    Roarr

    JSON logger for Node.js and browser

    JSON logger for Node.js and browser. A transport in most logging libraries is something that runs in-process to perform some operation with the finalized log line. For example, a transport might send the log line to a standard syslog server after processing the log line and reformatting it. Roarr does not support in-process transports because Node processes are single threaded processes (ignoring some technical details). Given this restriction, Roarr purposefully offloads handling of the...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    OpenDataLoader PDF

    OpenDataLoader PDF

    PDF Parser for AI-ready data. Automate PDF accessibility

    OpenDataLoader PDF is an open-source document processing system designed to convert complex PDF files into structured, AI-ready formats such as Markdown, JSON, and HTML while preserving layout, hierarchy, and semantic meaning. It focuses on enabling downstream use cases like retrieval-augmented generation (RAG), knowledge extraction, and document intelligence pipelines by maintaining accurate reading order and spatial metadata through bounding boxes. The tool combines deterministic parsing...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 3
    StringZilla

    StringZilla

    10x faster string search, split, sort, and shuffle for long strings

    StringZilla is the Godzilla of string libraries, splitting, sorting, and shuffling large textual datasets. StringZilla uses a heuristic so simple it's almost stupid... but it works. It matches the first few letters of words with hyper-scalar code to achieve memcpy speeds. The implementation fits into a single C 99 header file and uses different SIMD flavors and SWAR on older platforms. The Str is designed to replace long Python str strings and wrap our C-level API. On the other hand, the...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 4
    JC

    JC

    CLI tool and python library

    ...This allows piping of output to tools like jq and simplifying automation scripts. jc JSONifies the output of many CLI tools and file types for easier parsing in scripts. This allows further command-line processing of output with tools like jq or jello by piping commands. The JC parsers can also be used as python modules. In this case, the output will be a python dictionary, or a list of dictionaries, instead of JSON. Two representations of the data are available. The default representation uses a strict schema per parser and converts known numbers to int/float JSON values. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    Cocur Slugify

    Cocur Slugify

    Converts a string to a slug. Includes integrations for Symfony

    Slugify is a PHP library that converts strings into URL-friendly slugs. It replaces spaces and special characters with hyphens or other specified separators, making it ideal for generating SEO-friendly URLs. Slugify is lightweight, fast, and highly configurable, supporting custom rules and language-specific transliterations for accurate slug creation.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Format Converter - Intel HEX, Elf32, BIN

    Format Converter - Intel HEX, Elf32, BIN

    File Format Converter Between HEX / BIN / ELF

    Intel HEX File Processing Software File Format Converter Between HEX / BIN / ELF The source code was created and debugged in the eS development environment (for Linux and Windows) and is an example of its use. https://esolution.sourceforge.io/
    Downloads: 22 This Week
    Last Update:
    See Project
  • 7
    HCL

    HCL

    HCL is the HashiCorp configuration language

    HCL is a toolkit for creating structured configuration languages that are both human- and machine-friendly, for use with command-line tools. Although intended to be generally useful, it is primarily targeted toward DevOps tools, servers, etc. HCL has both a native syntax, intended to be pleasant to read and write for humans, and a JSON-based variant that is easier for machines to generate and parse. It includes an expression syntax that allows basic inline computation and, with support from...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 8
    mp-html

    mp-html

    Small program rich text component, supports rendering and editing html

    A powerful applet-rich text component. Small program rich text component supports rendering and editing HTML and supports use on WeChat, QQ, Baidu, Alipay, Toutiao, and uni-app platforms. Displaying dynamic HTML rich text is a necessary requirement for many applications. The applet platform does not support dom operations, making this a problem. The built-in rich-text component supports few tags and blocks all events, making it difficult for practical application. Therefore, there is such a...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    XLSX

    XLSX

    Go (golang) library for reading and writing XLSX files

    ...These can be used to modify the behavior of the resultant struct, in particular they replace the `…WithRowLimit` variants of those methods with the result of calling `xlsx.RowLimit` and they add the ability to define a custom backing store for the spreadsheet data to be held in whilst processing. The full API docs can be viewed using go’s built in documentation tool.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    Emogrifier

    Emogrifier

    Converts CSS styles into inline style attributes in your HTML code

    Emogrifier converts CSS styles into inline style attributes in your HTML code. This ensures proper display on email and mobile device readers that lack stylesheet support. This utility was developed as part of Intervals to deal with the problems posed by certain email clients (namely Outlook 2007 and GoogleMail) when it comes to the way they handle styling contained in HTML emails. As many web developers and designers already know, certain email clients are notorious for their lack of CSS...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Gwyddion

    Gwyddion

    Scanning probe microscopy data visualisation and analysis

    A data visualization and processing tool for scanning probe microscopy (SPM, i.e. AFM, STM, MFM, SNOM/NSOM, ...) and profilometry data, useful also for general image and 2D data analysis.
    Leader badge
    Downloads: 1,699 This Week
    Last Update:
    See Project
  • 12
    SingleFile

    SingleFile

    Web Extension for saving a copy of complete web page in a single file

    Web Extension for Firefox/Chrome/MS Edge and CLI tool to save a faithful copy of an entire web page in a single HTML file. SingleFile is a Web Extension (and a CLI tool) compatible with Chrome, Firefox (Desktop and Mobile), Microsoft Edge, Vivaldi, Brave, Waterfox, Yandex Browser, and Opera. It helps you to save a complete web page into a single HTML file. Wait until the page is fully loaded. Click on the SingleFile button in the extension toolbar to save the page. You can click again on the...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 13
    BibDesk

    BibDesk

    Bibliography manager for Mac OS X

    BibDesk is a graphical bibTeX bibliography manager for Mac OS X.
    Leader badge
    Downloads: 2,751 This Week
    Last Update:
    See Project
  • 14
    biber
    Biber is a sophisticated bibliography processing backend for the LaTeX biblatex package. It supports a unsurpassed feature set for automated conformance to complex bibliography style requirements such as labelling, sorting and name handling. It has comprehensive Unicode support.
    Leader badge
    Downloads: 401 This Week
    Last Update:
    See Project
  • 15
    XML Copy Editor
    XML Copy Editor is a fast, free, validating XML editor.
    Leader badge
    Downloads: 945 This Week
    Last Update:
    See Project
  • 16
    LaTeXML

    LaTeXML

    A TeX and LaTeX to XML/HTML/ePub/MathML translator

    LaTeXML is a tool that converts LaTeX documents into structured formats like HTML, MathML, and ePub. Unlike traditional TeX-to-PDF processors, LaTeXML preserves semantic content, making it suitable for web publishing, accessibility, and content reuse. It supports a wide range of LaTeX packages and is designed to enable high-quality rendering of mathematical and scientific documents.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    WebHarvest - web data extraction tool
    Web data extraction (web data mining, web scraping) tool. It leverages well proved XML and text processing techologies in order to easely extract useful data from arbitrary web pages.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 18
    biblatex
    Biblatex is a LaTeX package which provides full-featured bibliographic facilities
    Leader badge
    Downloads: 74 This Week
    Last Update:
    See Project
  • 19
    Text Encoding Initiative

    Text Encoding Initiative

    TEI produces the TEI Guidelines and associated software

    The TEI is an international and interdisciplinary standard used by libraries, museums, publishers, and academics to represent all kinds of literary and linguistic texts, using an encoding scheme that is maximally expressive and minimally obsolescent.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 20
    Microsoft Works format import library
    libwps is a Microsoft Works file format import filter based on top of the librevenge (see https://sourceforge.net/p/libwpd/wiki/librevenge/ ). Currently, libwps can import all word processing Works formats since about 1995 with some success. It may also be able to import some basic database and spreadsheet files.
    Leader badge
    Downloads: 365 This Week
    Last Update:
    See Project
  • 21
    ant4docbook

    ant4docbook

    ANT4DOCBOOK is an ANT task for DOCBOOK

    ANT4DOCBOOK is an ANT task for DOCBOOK, a semantic markup language for technical documentation.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    asammdf

    asammdf

    Fast Python reader and editor for ASAM MDF / MF4 (Measurement Format)

    *asammdf* is a fast Python parser and editor for ASAM (Associtation for Standardisation of Automation and Measuring Systems) MDF / MF4 (Measurement Data Format) files. It supports MDF versions 2 (.dat), 3 (.mdf) and 4 (.mf4). *asammdf* works on Python 2.7, and Python >= 3.4
    Leader badge
    Downloads: 105 This Week
    Last Update:
    See Project
  • 23
    Biosignal Tools
    BioSig is a software library for processing of biomedical signals (EEG, ECG, etc.) with Matlab, Octave, C/C++ and Python. About 50 different data formats are supported.
    Leader badge
    Downloads: 190 This Week
    Last Update:
    See Project
  • 24
    TextExtractor

    TextExtractor

    Extracts plain text from a variety of different file types

    TextExtractor extracts plain text from hundreds of different file types, storing the text extracted in suitably named text files. TextExtractor 1.10 works in six different modes :- Instant Mode - Just select any file and extract the text from it. Batch Mode - Select a group of files and extract the text from all of them in one go. Polling Mode - Watch a folder location, processing new files as they appear there. Hierarchical Mode - Extract Text from files in a directory...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 25
    gSOAP Toolkit

    gSOAP Toolkit

    Development toolkit for Web Services and XML data bindings for C & C++

    The gSOAP toolkit is an extensive suite of portable C and C++ software to develop XML Web services with powerful type-safe XML data bindings. Easy-to-use code-generator tools allow you to directly integrate XML data in C and C++. Serializes native application data in XML. Includes WSDL/XSD schema binding and auto-coding tools, stub/skeleton compiler, Web server integration with Apache module and IIS extension, high-performance XML processing with schema validation, fast MIME/MTOM streaming, SOAP and REST Web API development, WS-* protocols (WS-Security, WS-Policy, WS-ReliableMessaging, etc), XML-RPC and JSON. ...
    Leader badge
    Downloads: 713 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB