Showing 320 open source projects for "text based"

View related business solutions
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 1
    TeXworks

    TeXworks

    A simple interface for working with TeX documents

    TeXworks is a free and simple working environment for authoring TeX (LaTeX, ConTeXt and XeTeX) documents. Inspired by Dick Koch's award-winning TeXShop program for Mac OS X, it makes entry into the TeX world easier for those using desktop operating systems other than OS X. It provides an integrated, easy-to-use environment for users on other platforms particularly GNU/Linux and Windows and features a clean, simple interface accessible to casual and non-technical users.
    Downloads: 67 This Week
    Last Update:
    See Project
  • 2
    MinDoc

    MinDoc

    Document management system developed for the IT team

    MinDoc is a simple and easy-to-use document management system developed for the IT team. MinDoc's predecessor was the SmartWiki document system. SmartWiki is a document management system developed based on the PHP framework laravel. Because the deployment of PHP is too complicated for ordinary users, it was developed instead of Golang. It can facilitate user deployment and practicality. The starting point for development is that the company's IT department needs a simple and practical system...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    npm-pdfreader

    npm-pdfreader

    Parse text and tables from PDF files.

    npm-pdfreader is a Node.js library for reading text and parsing tables from PDF files. It supports tabular data with automatic column detection and rule-based parsing, making it useful for extracting structured data from PDFs. ​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    PDF4QT

    PDF4QT

    Open source PDF editor

    ...Basic browsing and lots of other functionalities, such as encryption, reading a document, verification of digital signatures, editing of annotations, searching for text using regular expressions, drawing pages into an image, and much more. Several plug-ins are available.
    Downloads: 56 This Week
    Last Update:
    See Project
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 5
    OpenMed

    OpenMed

    Open source healthcare AI

    OpenMed is an open-source healthcare AI and medical NLP toolkit designed to turn clinical text into structured insights using transformer-based models and production-oriented interfaces. Its core purpose is to provide specialized medical entity extraction, PII detection and de-identification, assertion-aware analysis, and related healthcare text processing capabilities without locking users into a proprietary platform. The project includes a curated registry of more than a dozen medical NER models focused on areas such as diseases, drugs, anatomy, genes, and protected health information, and it is built to support both research and deployment scenarios. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 6
    EtherCalc

    EtherCalc

    Open source web spreadsheet

    EtherCalc is an open source, web-based spreadsheet that allows multiple users to collaborate and work remotely and in real time. Users can edit the same spreadsheet at the same time, with changes instantly reflected on screen. It accepts various mathematical formulae and other functions, and supports several default text formatting directives as well. It can be used for just about anything you’d need a desktop spreadsheet for-- accounting, inventory, scheduling, list management and so much more. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    kb

    kb

    A minimalist command line knowledge base manager

    kb is a minimalist command-line knowledge base manager that gives users a fast, organized way to collect, store, search, and retrieve notes, documents, cheatsheets, procedures, and other artifacts directly from the terminal. It was created to solve the common problem of having scattered text files or reference materials on disk that are hard to search or categorize, and it surfaces a simple CLI interface with intuitive commands for adding, viewing, editing, and deleting knowledge items. Each entry in kb can be tagged, categorized, given metadata like author or status, and inspected with full-text search or regex-based grepping, helping users quickly find content even across large knowledge collections. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Papermerge

    Papermerge

    Open Source Document Management System for Digital Archives

    ...OCR technology is vital part of Papermerge. It extracts text information from scanned documents, PDF, JPEG, TIFF files.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    Vespa

    Vespa

    The open big data serving engine

    Make AI-driven decisions using your data, in real-time. At any scale, with unbeatable performance. Vespa is a full-featured text search engine and supports both regular text search and fast approximate vector search (ANN). This makes it easy to create high-performing search applications at any scale, whether you want to use traditional techniques or a modern vector-based approach. You can even combine both approaches efficiently in the same query, something no other engine can do. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    CryptPad
    CryptPad is an open-source, end-to-end encrypted collaborative office suite developed by XWiki SAS. It offers a privacy-focused alternative to mainstream cloud-based productivity tools, enabling users to create, edit, and share documents securely without compromising data privacy. All content is encrypted client-side, ensuring that only authorized users can access the information. CryptPad supports various applications, including rich text documents, spreadsheets, presentations, code editing, kanban boards, polls, and whiteboards, facilitating real-time collaboration among teams. ...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 11
    HyperTools

    HyperTools

    A Python toolbox for gaining geometric insights

    ...Simple API for customizing plot styles. Set of powerful data manipulation tools including hyperalignment, k-means clustering, normalizing and more. Support for lists of Numpy arrays, Pandas dataframes, text or (mixed) lists. Applying topic models and other text vectorization methods to text data. HyperTools is designed to facilitate dimensionality reduction-based visual explorations of high-dimensional data. The basic pipeline is to feed in a high-dimensional dataset (or a series of high-dimensional datasets) and, in a single function call, reduce the dimensionality of the dataset(s) and create a plot.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Automa.jl

    Automa.jl

    A julia code generator for regular expressions

    Automa is a regex-to-Julia compiler. By compiling regex to Julia code in the form of Expr objects, Automa provides facilities to create efficient and robust regex-based lexers, tokenizers and parsers using Julia's metaprogramming capabilities. You can view Automa as a regex engine that can insert arbitrary Julia code into its input-matching process, which will be executed when certain parts of the regex match an input.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Loro

    Loro

    Make your JSON data collaborative and version-controlled with CRDTs

    loro is a high-performance CRDT (Conflict-free Replicated Data Type) engine designed for building collaborative applications that sync in real time across multiple peers or devices. Written in Rust, loro is designed to be compact, fast, and embeddable in a wide range of environments, from desktop to mobile to web via WebAssembly. Its architecture supports multiple data types like text, maps, and lists, and it offers automatic conflict resolution with minimal data overhead. It is ideal for...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Elasticsearch

    Elasticsearch

    A Distributed RESTful Search Engine

    Elasticsearch is a distributed, RESTful search and analytics engine that lets you store, search and analyze with ease at scale. It lets you perform and combine many types of searches; it scales seamlessly, and offers answers incredibly fast with search results you can rank based on a variety of factors. Elasticsearch can be used for a wide variety of use cases, from maps and metrics to site search and workplace search, and with all data types.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 15
    Asciidoctor PDF

    Asciidoctor PDF

    Asciidoctor PDF: A native PDF converter for AsciiDoc

    A fast text processor & publishing toolchain for converting AsciiDoc to HTML5, DocBook & more. Asciidoctor is a fast, open source, Ruby-based text processor for parsing AsciiDoc® into a document model and converting it to output formats such as HTML 5, DocBook 5, manual pages, PDF, EPUB 3, and other formats. Asciidoctor also has an ecosystem of extensions, converters, build plugins, and tools to help you author and publish content written in AsciiDoc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    OrientDB

    OrientDB

    DBMS supporting graph, document, full-text and geospatial models

    OrientDB is an Open Source Multi-Model NoSQL DBMS with the support of Native Graphs, Documents, Full-Text search, Reactivity, Geo-Spatial and Object Oriented concepts. It's written in Java and it's amazingly fast. No expensive run-time JOINs, connections are managed as persistent pointers between records. You can traverse thousands of records in no time. Supports schema-less, schema-full and schema-mixed modes. Has a strong security profiling system based on user, roles and predicate security and supports SQL amongst the query languages. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    htop

    htop

    An interactive process viewer

    htop is a cross-platform interactive process viewer. htop allows scrolling the list of processes vertically and horizontally to see their full command lines and related information like memory and CPU consumption. Also, system-wide information, like load average or swap usage, is shown. The information displayed is configurable through a graphical setup and can be sorted and filtered interactively. Tasks related to processes (e.g. killing and renicing) can be done without entering their...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 18
    pprof

    pprof

    pprof is a tool for visualization and analysis of profiling data

    pprof is a profiling visualization and analysis tool that ingests profiles in the profile.proto format and generates human-readable and graph-based reports. It supports multiple profile types (CPU, heap, allocations, contention, etc.) and can present data as text tables, call graphs (via Graphviz/dot), flame graphs, and interactive web UIs. The tool helps developers find hot paths, quantify resource usage, and compare profiles across runs to validate performance changes. It is widely used in Go but also has bindings and exporters for other ecosystems, and the repository includes a Go package for reading and writing profiles programmatically. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    patat

    patat

    Terminal-based presentations using Pandoc

    patat (Presentations Atop The ANSI Terminal) is a small tool that allows you to show presentations using only an ANSI terminal. It does not require ncurses. Leverages the great Pandoc library to support many input formats including Literate Haskell. Supports smart slide splitting. Slides can be split up into multiple fragments. There is a live reload mode. Theming support including 24-bit RGB. Auto advancing with configurable delay. Optionally re-wrapping text to terminal width with proper...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Searchkick

    Searchkick

    Intelligent search made easy

    Searchkick brings powerful, production-ready search to Rails by mapping Active Record models into Elasticsearch with sensible defaults and easy customization. It supports language analyzers, stemming, synonyms, misspelling tolerance, and highlighting so search results feel natural to end users. Indexing is model-centric: you declare what fields to index, add computed fields, and trigger reindexing via callbacks or background jobs, with options for zero-downtime rolling reindexes. On the...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 21
    Orchid

    Orchid

    Rapid application development of back-office applications

    Orchid is a free Laravel package that abstracts standard business logic and allows code-driven rapid application development of back-office applications, admin/user panels, and dashboards. Focus on PHP development and don't lose time with HTML, CSS, or JavaScript. Build application logic, not admin panels. Try our quick start guide and kick-start your application's development. Prevent reinventing the wheel or forms. Orchid already supports many form elements "out of the box" and allows you...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Sage

    Sage

    WordPress starter theme with a modern development workflow

    Write stylesheets with Sass, automatically check your JavaScript for errors, optimize images, enable synchronized browser testing, and more with our Webpack based setup. Sage comes with the latest version of Bootstrap. Replace Bootstrap with Bulma, Foundation, Tachyons, Tailwind, or a blank slate upon installation. You will have the option to define theme meta information (name, URI, description, version, author) and choose a CSS framework. Sage uses Laravel's Blade templating engine, and since the .blade.php files live in a publicly accessible directory on your webserver, we recommend preventing plain-text access to them. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    Architecture as a code

    Architecture as a code

    Visualize, collaborate, and evolve the software architecture

    Architecture as a code is an open-source modeling language and toolkit that enables software teams to describe, visualize, collaborate on, and maintain software architecture as code. Inspired by the C4 Model and other architectural DSLs, LikeC4 lets you define your system’s structure in a textual DSL and then automatically generate consistent diagrams that reflect that design, ensuring that architecture documentation stays in sync with source code changes. The project includes command-line...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Obsidian Visual Skills Pack

    Obsidian Visual Skills Pack

    Generate Canvas, Excalidraw, and Mermaid diagrams from text

    LLM-TLDR is a Python-based tool designed to dramatically reduce the amount of code a large language model needs to read by extracting the essential structure and context from a codebase and presenting only the most relevant parts to the model. Traditional approaches often dump entire files into a model’s context, which quickly exceeds token limits; LLM-TLDR instead indexes project structure, traces dependencies, and summarizes code in a way that preserves semantic relevance while shrinking...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Conversational Health Agents (CHA)

    Conversational Health Agents (CHA)

    A Personalized LLM-powered Agent Frameworks

    CHA, or Conversational Health Agents, is an open-source framework designed to build intelligent healthcare assistants powered by large language models and external data sources. The system enables developers to create personalized AI agents that can interact with users through natural language while performing multi-step reasoning and task execution. It integrates orchestration capabilities that allow the agent to gather information from APIs, knowledge bases, and external services in order...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB