Showing 65 open source projects for "text processing"

View related business solutions
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 1
    pg_textsearch

    pg_textsearch

    PostgreSQL extension for BM25 relevance-ranked full-text search

    ...It also supports advanced query features such as ranking, filtering, and linguistic processing. By embedding search capabilities within the database, it simplifies architecture and reduces operational complexity. The project is particularly useful for applications that require fast and accurate text retrieval. Overall, pg_textsearch extends PostgreSQL into a more powerful platform for text-based data exploration.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    GitHub search with Manticore Search

    GitHub search with Manticore Search

    Demo: GitHub search with Manticore Search

    GitHub search with Manticore Search. The Manticore GitHub Issue Search tool allows users to search through GitHub issues using Manticore Search, a powerful full-text search engine designed for large datasets and real-time processing. It integrates Manticore's capabilities with GitHub to offer fast and efficient searches within repositories.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    syslog-ng

    syslog-ng

    Log management solution that improves the performance of SIEM

    syslog-ng is the log management solution that improves the performance of your SIEM solution by reducing the amount and improving the quality of data feeding your SIEM. With syslog-ng Store Box, you can find the answer. Search billions of logs in seconds using full text queries with Boolean operators to pinpoint critical logs. syslog-ng Store Box provides secure, tamper-proof storage and custom reporting to demonstrate compliance. syslog-ng can deliver data from a wide variety of sources to...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Bowtie, an ultrafast, memory-efficient short read aligner for short DNA sequences (reads) from next-gen sequencers. Please cite: Langmead B, et al. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 10:R25.
    Leader badge
    Downloads: 382 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    Jina

    Jina

    Build cross-modal and multimodal applications on the cloud

    Jina is a framework that empowers anyone to build cross-modal and multi-modal applications on the cloud. It uplifts a PoC into a production-ready service. Jina handles the infrastructure complexity, making advanced solution engineering and cloud-native technologies accessible to every developer. Build applications that deliver fresh insights from multiple data types such as text, image, audio, video, 3D mesh, PDF with Jina AI’s DocArray. Polyglot gateway that supports gRPC, Websockets, HTTP,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Text Line Duplicate Remover

    Text Line Duplicate Remover

    Remove duplicate lines from your text

    This standalone offline web browser tool helps you remove duplicate lines from your text, with additional text processing options. Simply open it in your browser by double-clicking the html file. It also includes the source code too. I made this when I was working with long lists of entries and needed something to automatically clean them up. As a bonus you can also change the Sentence Case of the text, make it lowercase, UPPERCASE or Sentence case.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Swiss File Knife

    Swiss File Knife

    One hundred command line tools in a small and portable binary.

    Create zip files, extract zip files, replace text in files, search in files using expressions, stream text editor, instant command line ftp and http server, send folder via network, copy folder excluding sub folders and files, find duplicate files, run a command on all files of a folder, split and join large files, make md5 checksum lists of files, remove tab characters, convert CR/LF, list newest or biggest files of a folder, compare folders, treesize, show first or last lines of a file,...
    Leader badge
    Downloads: 400 This Week
    Last Update:
    See Project
  • 8
    Script Echo Color

    Script Echo Color

    Terminal text colorizing and simplifies script coding.

    ScriptEchoColor simplifies Linux terminal text colorizing, formatting and several steps of script coding.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 9
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing SDK in C++20

    DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10
    AI File Sorter

    AI File Sorter

    Local AI file organization with categorization and rename suggestions

    AI File Sorter is a cross-platform desktop application that uses AI (local LLMs run on your computer) to organize files and suggest meaningful file names based on real content, not just filenames or extensions. The app can analyze images locally and propose descriptive rename suggestions (for example, IMG_2048.jpg → clouds_over_lake.jpg). It can also analyze document text to improve categorization and renaming. Supported formats include PDF, DOCX, XLSX, PPTX, ODT, ODS, ODP, and common...
    Leader badge
    Downloads: 516 This Week
    Last Update:
    See Project
  • 11
    TextSeek

    TextSeek

    Professional full-text desktop search tool

    TextSeek is a professional full-text desktop search tool. Unlike the filename search tool like Everything and Listary, TextSeek can search filename and file content easily and quickly. It supports PDF, Word, Excel, Powerpoint, RTF and other formats. The software can run directly, and no extra package is required to install.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 12
    AutoText

    AutoText

    Text expander(auto-type) application

    AutoText is a text expander(auto-typer) application that can minimize time that you spent typing phrases that your often use. You can bind an abbreviation(autotext) to some text snippet, phrase or sentence and that text will be automatically typed by AutoText each time you type the abbreviation. Keyboard keys press/hold/toggle is supported by using macros. AutoText works in any Windows program.
    Leader badge
    Downloads: 58 This Week
    Last Update:
    See Project
  • 13
    FAR - Find And Replace
    Search and replace operations on file content accross multiple files. Recursive operations within entire directory trees. FAR comes with support for regular expressions (regex) over multiple lines, automatic backup and various character encodings. Run grep like extractions to condense or rearrange sources, or perform bulk file renaming.
    Downloads: 27 This Week
    Last Update:
    See Project
  • 14
    A collection of small utilities for: data extraction (text or binary files), data buffering, message queue control, column addition, date/time manipulation, and data recovery testing.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15

    FastoCloud PRO

    IPTV/NVR/CCTV/Video cloud https://fastocloud.com

    IPTV/Video cloud Features: Cross-platform (Linux, MacOSX, FreeBSD, Raspbian/Armbian) GPU/CPU Encode/Decode/Post Processing Stream statistics CCTV Adaptive hls streams Load balancing Temporary urls HLS push EPG scanning Subtitles to text conversions AD insertion Logo overlay Video effects Relays Timeshifts Catchups Playlists Restream/Transcode from online streaming services like Youtube, Twitch Mozaic Many Outputs Physical Inputs Streaming Protocols File Formats Presets Vods/Series server-side support Pay per view channels Channels on demand HTTP Live Streaming (HLS) server-side support Public API, client server communication via JSON RPC Protocol gzip compression Deep learning video analysis Supported deep learning frameworks: Tensorflow NCSDK Caffe ML Hardware:
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Clu-Linux-Live

    Clu-Linux-Live

    Various Processing and Data Rescue Tools over Wired or Wireless Networ

    This Linux Live CD provides Various Processing Command Line Utilities (Clu) and Data Rescue Tools which can be used on a Wired or Wireless Network. On Startup it prompts the user to change password, mount all filesystems available locally, start wireless network ( if wifi interface present ), start network services (samba/ssh/sftp) and present user with a console for executing various utilities i.e Text, Image, Audio, Video, Downloading etc. on their FileSystems that are mounted. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 17
    XMLStarlet is a set of command line utilities (tools) to transform, query, validate, and edit XML documents and files using simple set of shell commands in similar way it is done for text files with UNIX grep, sed, awk, diff, patch, join, etc utilities.
    Leader badge
    Downloads: 1,286 This Week
    Last Update:
    See Project
  • 18
    Arabic Keyboard

    Arabic Keyboard

    Arabic Keyboard Phonetic

    Can't Remember the Position of Arabic Letters on your Keyboard? Try This Phonetic Arabic Keyboard Layout. Letters correspond to Latin / English / German Letters of similar pronunciation, which makes it much faster and easier to type. Example: the Arabic Letter Meem "م" can be typed by pressing "M". They Layout can be either used online on arab-key.com or installed as a System Language for macOS and Windows. Includes Arabic, Persian, Kurdish Letters. All Features Free
    Leader badge
    Downloads: 145 This Week
    Last Update:
    See Project
  • 19
    360-FAAR  Firewall Analysis Audit Repair

    360-FAAR Firewall Analysis Audit Repair

    360-FAAR Analyze FW1 Cisco Netscreen Policy Offline Using Config/Logs

    360-FAAR (Firewall Analysis Audit and Repair) is an offline, command line, firewall policy manipulation tool to filter, compare to logs, merge, translate and output firewall commands for new policies, in Checkpoint dbedit, Cisco ASA or ScreenOS commands, and its one file! Read Policy and Logs for: Checkpoint FW1 (in odumper.csv / logexport format), Netscreen ScreenOS (in get config / syslog format), Cisco ASA (show run / syslog format), 360-FAAR compares firewall policies and uses...
    Leader badge
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    text replace case insensitive c# vb.net

    text replace case insensitive c# vb.net

    .NET replace text case insensitive

    Fast algorithm text (string) replace case insensitive writing in .NET. Increase the performance of your application!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    LogDruid

    Generate charts and reports using data gathered in log files

    An application to gather, aggregate, chart and report information originating from any log files. It uses regular expressions that are constructed graphically and can be tested in the application against samples. Once configured for a specific type of log file set, the gathering and display of the chart for a new files set can be done in just one click. Contains a sample template to handle few log types: Java GC log, OpenDS access log, Apache access log
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    PanDocElectron

    Graphical User Interface for PanDoc for Linux, Mac & Windows

    PanDoc Graphical User Interface implemented with Electron for Linux, Mac and Windows. It support users in converting source documents into various other formats like docx, odt, html and reveal documentation. The zip files contain the full source code because PanDocElectron is written in HTML/Javascript. Electron is used more or less as browser that runs the HTML/Javascript application. [Download PanDocElectron](https://sourceforge.net/p/pandocelectron/wiki/Home/) Extract the zip-file...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    Nanolinux

    Nanolinux

    Tiny Linux desktop distribution in just 14 MB

    ...It uses Nano-X instead of X11. FLTK 1.3.x is used as the GUI toolkit. The window manager is SLWM. The following applications are included: Dillo browser FlWriter text processing program Sprsht spread sheet program FLTDJ personal information manager FlMusic CD player FlRadio internet radio FlChat IRC client Antipaint paint program Fluff file manager NXterm terminal emulator Flcalc calculator FlView image viewer Fleditor Games: Tuxchess, NXeyes, Checkers, Blocks, Sudoku, Mastermind Mount tool, system statistics Software package download and install utility NTFS support Grub package for hard disk install OSS sound, Truetype fonts, UTF-8 support Nanolinux runs completely from RAM. ...
    Downloads: 53 This Week
    Last Update:
    See Project
  • 24
    MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    Tengwar Writer

    Tengwar Writer provides keyboard layouts to write directly in Tengwar

    Tengwar Writer is NOT a transcripter! It changes the keyboard layout for writing in Tengwar in a easy way. This means, if you type "T" you will get the Tengwar character for T and not something other. You can make your own modes for other languages or just your prefered keyboard layout. With the visual keyboard you can see what Tengwar character is on which key. THIS IS ALPHA!!!
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
MongoDB Logo MongoDB