Showing 176 open source projects for "data processing"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    Flat file extractor can be used for reading and parsing different flat file structures and printing them in different formats. ffe is a command line tool developed in GNU/Linux environment and it is distributed under GPL. Project moved to https://github.com/igitur/ffe
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    A collection of small utilities for: data extraction (text or binary files), data buffering, message queue control, column addition, date/time manipulation, and data recovery testing.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Xi Editor

    Xi Editor

    A modern editor with a backend written in Rust

    ...Its architecture splits a thin UI layer from a high-performance core engine (written in Rust) that handles buffer editing, syntax highlighting, undo/redo, searching, and background processing asynchronously. This separation lets the core focus on speed and concurrency, while multiple frontends (macOS, GTK, web) handle rendering, input, and platform bridging. Xi is designed to scale to huge files (gigabyte scale) without lag, thanks to a rope data structure and efficient diffing algorithms. Plugins communicate with the core via JSON-RPC, enabling asynchronous extensions without blocking editing responsiveness. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    fastText

    fastText

    Library for fast text classification and representation

    FastText is an open-source, free, lightweight library that allows users to learn text representations and text classifiers. It works on standard, generic hardware. Models can later be reduced in size to even fit on mobile devices. ext classification is a core problem to many applications, like spam detection, sentiment analysis or smart replies. In this tutorial, we describe how to build a text classifier with the fastText tool. The goal of text classification is to assign documents (such as...
    Downloads: 3 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 5
    LaTeX Reference Card Creator

    LaTeX Reference Card Creator

    A Makefile based build system for creating LaTeX reference cards

    LaTeX Reference Card Creator is a Makefile based build system for creating reference cards. LaTeX Reference Card Creator compiles content into PDF, DjVu, TEX DVI, HTML and PostScript output formats. A three column reference card will be created. Features include batch image format conversions, spell checking, broken link checking, automatic backups and .zip and .tar.gz distribution building. LaTeX Reference Card Creator provides many LaTeX examples which can be used to make a reference card.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    XSLT syntax highlighting

    XSLT syntax highlighting

    Java based XSLT Processor extension for syntax highlighting

    Please note that project moved to GitHub: https://github.com/xmlark/xslthl This is an implementation of syntax highlighting as an extension module for XSLT processors (Xalan, Saxon), so if you have e.g. article about programming written in DocBook, code examples can be automatically syntax highlighted during the XSLT processing phase.
    Leader badge
    Downloads: 191 This Week
    Last Update:
    See Project
  • 7
    Fen2eps is a small console program that converts FEN (Forsyth Edwards Notation) strings to EPS (Encapsulated Postscript) files containing the chess board diagram.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    iText®, a JAVA PDF library

    iText®, a JAVA PDF library

    PDF Library for Developers

    iText is an open-source PDF library available for Java and .NET (C#). iText allows you to effortlessly generate and manipulate standards-compliant PDF documents with a powerful and feature-rich SDK. With iText, you can create archivable and accessible PDFs, split and merge documents, fill and flatten forms, digitally sign documents, and more. iText add-ons enable additional functionality, such as PDF creation from HTML templates, secure redaction, OCR, and much more. The latest...
    Leader badge
    Downloads: 185 This Week
    Last Update:
    See Project
  • 9
    PDF-Shuffler
    PDF-Shuffler is a small python-gtk application, which helps the user to merge or split pdf documents and rotate, crop and rearrange their pages using an interactive and intuitive graphical interface. It is a frontend for python-pyPdf.
    Leader badge
    Downloads: 56 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 10
    Command-line/Ant-task/embeddable text file preprocessor. Macros, flow control, expressions. Recursive directory processing. Extensible in Java to display data from any data sources (as database). Can generate complete homepages (tree of HTML-s, images, etc.)
    Leader badge
    Downloads: 84 This Week
    Last Update:
    See Project
  • 11
    Create beautiful song books for your church or fellowship using this LaTeX package and related tools.
    Leader badge
    Downloads: 27 This Week
    Last Update:
    See Project
  • 12
    A Swiss Army Knife GUI application for PDF documents: combine, split, rotate, reorder (n-up, booklet), watermark, edit bookmarks/fileinfo/pagetransition, compress, encrypt, decrypt, sign, repair, edit attachments and more.
    Leader badge
    Downloads: 93 This Week
    Last Update:
    See Project
  • 13
    dvi2bitmap is a utility to convert TeX DVI files directly to bitmaps, without going through the complicated (and slow!) route of conversion via PostScript and PNM. The prime motivation for this is to prepare mathematical equations for inclusion in HTML files, but there is a broad range of uses beyond that. dvi2bitmap... * is written in portable C++, and the program acts as a wrapper round the libdvi2bitmap library (both static and shareable), which abstracts DVI and PK files and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    JCLTP

    A Java Class Library for Text Processing

    JCLTP is a class library designed for processing text. JCLTP is free, open source and developed with the Java programming language. JCLTP is distributed under the GNU license. It incorporates several technologies that enable process information while applying AI techniques, in order to build predictive models for text classification. Through a flexible structure of interfaces and classes, the opportunity to extend, adapt and add functionality JCLTP is provided. Thus, analysis of new types...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    LogDruid

    Generate charts and reports using data gathered in log files

    An application to gather, aggregate, chart and report information originating from any log files. It uses regular expressions that are constructed graphically and can be tested in the application against samples. Once configured for a specific type of log file set, the gathering and display of the chart for a new files set can be done in just one click. Contains a sample template to handle few log types: Java GC log, OpenDS access log, Apache access log
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    LaTeX Web Publisher

    LaTeX Web Publisher

    LaTeX Web Publisher is a Makefile based Web publishing system

    LaTeX Web Publisher is a Makefile based Web publishing system featuring content creation into HTML, non-split HTML, HTML Zip, PDF, DjVu, PostScript, DVI and Plain text formats. All LaTeX Web Publisher output formats are from a single LaTeX source and have indices. LaTeX Web Publisher can be used for website creation and has FTP deployment capabilities. A website created with LaTeX Web Publisher will have HTML, non-split HTML and PDF content formats. The website will have complete HTML...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    WebDjVuTextEd

    Edit the OCR text layer of DjVu documents in a web browser

    WebDjVuTextEd allows to edit the text layer of OCR'ed DjVu documents in a web browser. You can modify the structure (paragraphs, lines, words...) create, delete, edit text nodes, modify their container box by mouse, and run a spellchecker. The program does not directly read the DjVu files, it requires exported XML text data and images. When using without a webserver, you can open and save local files, but cannot take advantages of auto-save and spell checking. Note that current SVN...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    IMPORTANT NOTE: This project has moved to Github: https://github.com/pkozelka/libxml2-pas Pascal units accessing the popular XML API from Daniel Veillard ( http://www.xmlsoft.org ). This should be usable at least from Kylix and Delphi, but hopefully also from other Pascal compilers (like freepascal).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    nlconvert

    nlconvert

    Convert newlines in text files

    This package consists of six command-line utilities that let you convert the newlines of a text file from one style to another. It currently supports CR/LF (DOS/Windows), CR (Macintosh) and LF (Unix) newlines. It doesn’t try to be “intelligent”, since that will almost always fail at some point.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    MindRaider

    MindRaider is a personal notebook and outliner.

    MindRaider is a personal notebook and outliner. Where do you keep private remarks like ideas, plans, gift tips and howtos? Loads of documents and remarks spread around the file system? Can you find a remark when you need it? No? Try MindRaider!
    Downloads: 6 This Week
    Last Update:
    See Project
  • 22
    TemplateFx

    TemplateFx

    Dynamic Templating Tool

    MOVED TO GITHUB - http://templatefx.netnix.org TemplateFx is a Dynamic, Scriptable, Lightweight and Platform Independent Templating Tool which allows you to generate outputs based on a template and some source data. Its main use is in generating configurations for CLI based devices like Cisco routers and switches.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Jaxe
    Jaxe is a free Java XML editor with a configurable GUI, using XML schemas for validation and XSL for exports in HTML or XML.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    This is an Eclipse xml editor with several edition capabilities. The main features concern the interaction with the classes and resources declared in xml (Open class/resource, Create class), similar to the interaction between classes in java editor.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25

    Text Analyzer

    Text analyzing software

    An application developed in C using the list and the AVL tree data structures, which analyzes a text (.txt file) giving the following information as an output: 1. the total occurrences of every word in the text 2. the exact line of every occurrence of every word 3. the exact position in the line of every occurrence of every word 4. the exact paragraph of every occurrence of every word 5. the exact sentence of every occurrence of every word The output is also written in a...
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB