Showing 369 open source projects for "file text search"

View related business solutions
  • Add Two Lines of Code. Get Full APM. Icon
    Add Two Lines of Code. Get Full APM.

    AppSignal installs in minutes and auto-configures dashboards, alerts, and error tracking.

    Works out of the box for Rails, Django, Express, Phoenix, and more. Monitoring exceptions and performance in no time.
    Start Free
  • Go from Code to Production URL in Seconds Icon
    Go from Code to Production URL in Seconds

    Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try it free
  • 1

    Text File Generator

    Generate large text files for various testing purposes

    Generate large text files with dummy data (i.e. for performance testing an app that needs to read a file). This is a console app. It allows to generate large files (you can specify the size in megabytes). There is a template which you can set to define how a single line should look like, e.g. you can make a CSV file from it. This can contain random words, random numbers and sequential numbers.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    Saladict

    Saladict

    Pop-up dictionary and page translator with multiple search modes

    All-in-one professional pop-up dictionary and page translator which supports multiple search modes, page translations, new word notebook and PDF selection searching. Saladict supports mixed use of multiple search modes. Whether it be single-click, double-click, pop-up icon, hovering or keyboard shortcuts, you name it. The combination of dictionaries is dynamically adjusted according to search text and Saladict profile. Although Saladict is built for looking up English words, it also supports many other languages. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    Elasticsearch

    Elasticsearch

    A Distributed RESTful Search Engine

    Elasticsearch is a distributed, RESTful search and analytics engine that lets you store, search and analyze with ease at scale. It lets you perform and combine many types of searches; it scales seamlessly, and offers answers incredibly fast with search results you can rank based on a variety of factors. Elasticsearch can be used for a wide variety of use cases, from maps and metrics to site search and workplace search, and with all data types.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 4
    PaperQA2

    PaperQA2

    High accuracy RAG for answering questions from scientific documents

    PaperQA2 is a package for doing high-accuracy retrieval augmented generation (RAG) on PDFs or text files, with a focus on the scientific literature. See our recent 2024 paper to see examples of PaperQA2's superhuman performance in scientific tasks like question answering, summarization, and contradiction detection. In this example we take a folder of research paper PDFs, magically get their metadata - including citation counts and a retraction check, then parse and cache PDFs into a full-text search index, and finally answer the user question with an LLM agent.
    Downloads: 2 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 5
    ImHex

    ImHex

    A Hex Editor for Reverse Engineers, Programmers

    ...At the same time ImHex is completely free and open source under the GPLv2 license. Byte patching, patch management, Copy-Bytes-As-Array for many common programming languages, string and hex search, Bookmarks, Goto, Huge file support, variable column count, custom encodings and much more. ImHex includes a full custom C++-like pattern language that allows easy highlighting, decoding and analyzing of file formats.
    Downloads: 54 This Week
    Last Update:
    See Project
  • 6
    pyVideoTrans

    pyVideoTrans

    Translate the video from one language to another and embed dubbing

    pyVideoTrans is an ambitious open-source multimedia processing project that assembles speech recognition, subtitle generation, AI translation, voice synthesis, and video assembly into a unified pipeline for converting videos from one language to another with embedded dubbing and captions. At its core it runs speech-to-text models to transcribe audio tracks, translates the resulting text into a target language using local or cloud-based translation engines, synthesizes new speech to match the translated subtitles, and then merges that speech back into the video, creating a fully localized media file. The tool supports both command-line and GUI modes, making it accessible to developers and creatives needing batch or automated processing.
    Downloads: 19 This Week
    Last Update:
    See Project
  • 7
    Astropy

    Astropy

    Repository for the Astropy core package

    The Astropy Project is a community effort to develop a common core package for Astronomy in Python and foster an ecosystem of interoperable astronomy packages. Astropy is a Python library for use in astronomy. Learn Astropy provides a portal to all of the Astropy educational material through a single dynamically searchable web page. It allows you to filter tutorials by keywords, search for filters, and make search queries in tutorials and documentation simultaneously. The Anaconda Python...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Leiningen

    Leiningen

    Automate Clojure projects easily

    ...If your preferred package manager offers a recent version of Leiningen, try that first as long as it has version 2.x. Windows users can use the above script in the Linux subsystem or try the batch file or Powershell version instead. Most tasks need to be run from somewhere inside a project directory to work, but some (new, help, search, version, and repl) may run from anywhere.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    go1pylib

    go1pylib

    go1pylib is a Python library designed to control the Go1 robot

    go1pylib is a Python library designed to control the Go1 robot by Unitree Robotics. It provides an easy-to-use interface for robot movement, state management, collision avoidance, battery monitoring, and MQTT communication. Ideal for research and robotics development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 10
    go-i18n

    go-i18n

    Translate your Go program into multiple languages

    ...Supports pluralized strings for all 200+ languages in the Unicode Common Locale Data Repository (CLDR). Code and tests are automatically generated from CLDR data. Supports strings with named variables using text/template syntax. Supports message files of any format (e.g. JSON, TOML, YAML). Use goi18n extract to extract all i18n.Message struct literals in Go source files to a message file for translation. Create an empty message file for the language that you want to add (e.g. translate.es.toml). Run goi18n merge active.en.toml translate.es.toml to populate translate.es.toml with the messages to be translated. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    AlohaMini

    AlohaMini

    Open-Source Dual-Arm Mobile Robot with Motorized Lift

    AlohaMini is a lightweight self-hostable productivity and note-taking platform designed to help individuals organize thoughts, tasks, and knowledge in a fast, minimal interface that avoids unnecessary complexity. It combines hierarchical notebooks, rich text editing, and tagging, allowing users to structure information the way they think and refine that structure over time. The platform focuses on speed and responsiveness, with a UI that loads instantly and feels smooth even on modest...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Bowtie, an ultrafast, memory-efficient short read aligner for short DNA sequences (reads) from next-gen sequencers. Please cite: Langmead B, et al. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 10:R25.
    Leader badge
    Downloads: 443 This Week
    Last Update:
    See Project
  • 13

    Tokenized Text Aligner

    Aligns tokens in two versions of a text with differing tokenization.

    This tool performs token-by-token alignment of two versions of a text with differing tokenization by interpreting the results of a file diff (https://docs.python.org/3/library/difflib.html). It is intended for use in the preparation of annotated linguistic corpora, where differences in tokenization may arise (i) following corrections or modifications to the source text or (ii) through the creation of different layers of annotation (part-of-speech, treebank) requiring different tokenization. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    CiteFlow

    CiteFlow

    Desktop research workspace for PDFs, notes, citations, bibliographies.

    CiteFlow is a focused desktop research workspace for students, researchers, and academic writers who want to manage PDFs, notes, citations, and bibliographies in one place. Create project-based workspaces for essays, articles, reports, literature reviews, and long-form research. Import PDFs, read them inside the app, search within documents, compare files side by side, highlight key passages, and add page-based notes. CiteFlow can assist with DOI metadata detection, keeps citation history linked to copied text, and generates citations and bibliographies in APA 7 and MLA 9. OCR support helps make scanned PDFs searchable, while project-wide search helps you find matching text across PDFs, highlights, and notes. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    IMS Open Corpus Workbench

    IMS Open Corpus Workbench

    Indexing and query tools for very large text corpora

    The IMS Open Corpus Workbench is a collection of tools for managing and querying large text corpora (100 M words and more) with linguistic annotations. Its central component is the flexible and efficient query processor CQP, which can be used interactively in a terminal session, as a backend e.g. from a Perl script, or through the Web-based GUI CQPweb.
    Leader badge
    Downloads: 68 This Week
    Last Update:
    See Project
  • 16
    NumeRe

    NumeRe

    Framework for numerical computations, data analysis and visualisation

    Curve fitting | Data analysis | Plotting | Matrix operations | FFT | Extensible framework | Multiple file formats | Programmable | Open source | Free for everyone NumeRe: Framework for Numerical Computation is a numerical framework written for Microsoft Windows(R) and released under the GNU GPL v3 for solving and visualizing mathematical and physical problems numerically. Keep simple things simple: You want to plot a sine function? Just enter 'plot sin(x)'. You want to load some...
    Leader badge
    Downloads: 122 This Week
    Last Update:
    See Project
  • 17
    Skim

    Skim

    A PDF Reader and Note-taker for OS X

    Skim is a PDF reader and note-taker for OS X. It is designed to help you read and annotate scientific papers in PDF, but is also great for viewing any PDF file. Skim requires OS X 10.10 or higher.
    Leader badge
    Downloads: 5,064 This Week
    Last Update:
    See Project
  • 18
    YAT

    YAT

    Yet Another Terminal :: Serial Communication :: Engineer/Test/Debug

    Engineering, testing and debugging of serial communication. Supports RS-232/422/423/485 as well as TCP/IP Client/Server/AutoSocket, UDP/IP Client/Server/PairSocket and USB Ser/HID. Optimized for simple command sets of e.g. embedded systems.
    Leader badge
    Downloads: 577 This Week
    Last Update:
    See Project
  • 19
    RefDB is a reference database and bibliography tool for SGML, XML, and LaTeX documents, sort of a Reference Manager or BibTeX for markup languages. It is portable and known to run on Linux, Free/NetBSD, OSX, Solaris, and Windows/Cygwin.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 20
    Web-based Electronic Laboratory Notebook (ELN) with integrated Chemical Inventory by the group of Prof. Goossen (TU Kaiserslautern, Germany), based on PHP/MySQL. Allows (sub-)structure search, reaction planning, management of spectra and literature.
    Leader badge
    Downloads: 8 This Week
    Last Update:
    See Project
  • 21
    GeoNetwork opensource - Metadata Catalog
    A web based Metadata Catalog Server for data description and discovery. Supports both generic and geospatial data discovery. It implements international standards (e.g. ISO19115/19139/19119, ISO19115-3, DCAT-AP, CSW 2.0, OGC API Records). It originates from the United Nations and is used by many governments as geoportal software. Active development and discussion takes place on GitHub and OSGeo Discourse. You can also connect directly with the companies supporting the...
    Leader badge
    Downloads: 164 This Week
    Last Update:
    See Project
  • 22
    LaBB-CAT

    LaBB-CAT

    A linguistic annotation store

    LABB-CAT is a browser-based linguistics research tool that stores recordings and regular-expression searchable text transcripts of interviews. The search results, entire transcripts, and media, can be viewed or exported in a variety of format
    Downloads: 9 This Week
    Last Update:
    See Project
  • 23
    calculatorpp

    calculatorpp

    Calculator++

    Written in C++ (using template). Calculator++ is a application/library for Window (POSIX). The core of code is adaptable for posix. This algorithm support: + built-in operator + built-in functions + run-time functions and variables definition + run-time units system, units with prefix + constants definition + Pretty mathematical expressions - Process ascii files (dev) - vector and matrix (dev) - graph 2D and 3D (dev) - vector graph (dev) ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Libro

    Libro

    An interactive program for statistical analysis of texts

    A cross-platform text analysis program written in Python and Free Pascal/Lazarus which scans a whole text file (in plain text, HTML, EPUB, or ODT formats) and ranks all used words according to frequency, performing a quantitative analysis of the text using Shannon-Weaver information statistic and Zipf power law function. It counts words, sentences, chars, spaces, and syllables.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    JQM Java Quine McCluskey

    JQM Java Quine McCluskey

    JQM - Java Quine McCluskey for minimization of Boolean functions.

    ...Uniquely, JQM bridges the gap between theory and practice: it visualizes the solution process with generated Karnaugh Maps for students, while supporting PLC engineers by exporting results to Structured Text (ST) and Ladder Diagram (LD). The software includes a GUI for managing truth tables and supports diverse file formats for import and export.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB