Showing 51 open source projects for "word processing"

View related business solutions
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • Application Monitoring That Won't Slow Your App Down Icon
    Application Monitoring That Won't Slow Your App Down

    AppSignal's Rust-based agent is lightweight and stable. Already running in thousands of production apps.

    Full APM with errors, performance, logs, and uptime monitoring. 99.999% uptime SLA on the platform itself.
    Start Free
  • 1
    spaCy

    spaCy

    Industrial-strength Natural Language Processing (NLP)

    spaCy is a library built on the very latest research for advanced Natural Language Processing (NLP) in Python and Cython. Since its inception it was designed to be used for real world applications-- for building real products and gathering real insights. It comes with pretrained statistical models and word vectors, convolutional neural network models, easy deep learning integration and so much more. spaCy is the fastest syntactic parser in the world according to independent benchmarks, with an accuracy within 1% of the best available. ...
    Downloads: 99 This Week
    Last Update:
    See Project
  • 2
    gse

    gse

    Go efficient multilingual NLP and text segmentation

    Go efficient multilingual NLP and text segmentation; support English, Chinese, Japanese and others. Gse is implements jieba by golang, and try add NLP support and more feature. Support common, search engine, full mode, precise mode and HMM mode multiple word segmentation modes. Support user and embed dictionary, Part-of-speech/POS tagging, analyze segment info, stop and trim words. Support multilingual: English, Chinese, Japanese and others. Support Traditional Chinese. Support HMM cut text...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 3
    PHPWord

    PHPWord

    PHP library for reading and writing word processing documents

    PHPWord is a library written in pure PHP that provides a set of classes to write to and read from different document file formats. The current version of PHPWord supports Microsoft Office Open XML (OOXML or OpenXML), OASIS Open Document Format for Office Applications (OpenDocument or ODF), and Rich Text Format (RTF). PHPWord is an open source project licensed under the terms of LGPL version 3. PHPWord is aimed to be a high quality software product by incorporating continuous integration and...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 4
    txtai

    txtai

    Build AI-powered semantic search applications

    txtai executes machine-learning workflows to transform data and build AI-powered semantic search applications. Traditional search systems use keywords to find data. Semantic search applications have an understanding of natural language and identify results that have the same meaning, not necessarily the same keywords. Backed by state-of-the-art machine learning models, data is transformed into vector representations for search (also known as embeddings). Innovation is happening at a rapid...
    Downloads: 7 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 5
    Stanza

    Stanza

    Stanford NLP Python library for many human languages

    Stanza is a collection of accurate and efficient tools for the linguistic analysis of many human languages. Starting from raw text to syntactic analysis and entity recognition, Stanza brings state-of-the-art NLP models to languages of your choosing. Stanza is a Python natural language analysis package. It contains tools, which can be used in a pipeline, to convert a string containing human language text into lists of sentences and words, to generate base forms of those words, their parts of...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 6
    unioffice

    unioffice

    Pure go library for creating and processing Office Word documents

    unioffice is a library for creation of Office Open XML documents (.docx, .xlsx and .pptx). Its goal is to be the most compatible and highest-performance Go library for the creation and editing of docx/xlsx/pptx files. Every release of our libraries is automatically tested against known vulnerabilities and do not pass unless everything is remediated. All changes are carefully reviewed by our team. Our SDKs libraries are flexible and developed to solve common problems. We provide high-level...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 7
    BookStack

    BookStack

    Simple & Free Wiki Software

    BookStack is a free and open source platform for storing and organising information and documentation. A self-hosted and opinionated wiki system, BookStack is simple and easy to use, giving even new users with just basic word-processing skills a pleasant out of the box experience. BookStack offers a relaxed, open and positive approach. While the platform can provide advanced power features to those who want them, it is primarily designed not to be extensible outside of its core purpose. That being said, BookStack already comes with plenty of powerful features, such as search and linking, cross-book sorting, image management and more. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    OmegaT - multiplatform CAT tool

    OmegaT - multiplatform CAT tool

    The free computer aided translation (CAT) tool for professionals

    OmegaT is a free and open source multiplatform Computer Assisted Translation tool with fuzzy matching, translation memory, keyword search, glossaries, and translation leveraging into updated projects.
    Leader badge
    Downloads: 1,498 This Week
    Last Update:
    See Project
  • 9
    Lingua

    Lingua

    The most accurate natural language detection library for Java

    Its task is simple: It tells you which language some provided textual data is written in. This is very useful as a preprocessing step for linguistic data in natural language processing applications such as text classification and spell checking. Other use cases, for instance, might include routing e-mails to the right geographically located customer service department, based on the e-mails' languages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 10
    CNote

    CNote

    A Fast Lightweight Note & Code Editor

    CNote is a fast and lightweight note/code editor, it can help you quickly create/edit notes or write code. it has all the important features you need in one place, With Live Html Preview, Python Code Execution, and more, you can find the changelog at the Github repo.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 11
    Paperless-ng

    Paperless-ng

    A supercharged version of paperless, scan, index and archive docs

    Paperless is a simple Django application running in two parts, a Consumer (the thing that does the indexing) and a Web server (the part that lets you search & download already-indexed documents). Paper is a nightmare. Environmental issues aside, there’s no excuse for it in the 21st century. It takes up space, collects dust, doesn’t support any form of a search feature, indexing is tedious, it’s heavy and prone to damage & loss. I wrote this to make “going paperless” easier. I do not have to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    Word2ePub

    Add-in pour générer un ePub depuis Word (2007 et sup.)

    Update : J'ai eu beaucoup de difficultés avec l'installeur : actuellement, l'installeur fait une installation pour l'utilisateur actuel seulement. Les filtres chrome, Avast et Windows peuvent se déclencher car le fichier n'est pas souvent téléchargé... Word 2007: Attention, installer le hot fix de microsoft avant l'installation de word2epub :https://support.microsoft.com/en-us/kb/976811 Ce programme ajoute un onglet dans le ruban Word, pour permettre : 1. La génération d'ePub...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    jdw argv[]

    jdw argv[]

    argv processing support for PHP scripts

    'jdw_argv.php' is an argv processing support script for PHP. 'jdw_argv.php' will give you the ability to accept CLI argv option/value pairs as single 'char' or longer 'word' specifiers, and to be of a specified 'type', and can take the form: -option=value --option=value -option value --option value -o --option -ovalue (for char options only, with a single minus)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    fastNLP

    fastNLP

    fastNLP: A Modularized and Extensible NLP Framework

    fastNLP is a lightweight framework for natural language processing (NLP), the goal is to quickly implement NLP tasks and build complex models. A unified Tabular data container simplifies the data preprocessing process. Built-in Loader and Pipe for multiple datasets, eliminating the need for preprocessing code. Various convenient NLP tools, such as Embedding loading (including ELMo and BERT), intermediate data cache, etc..
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Roth

    Roth

    Roth is a full-stack enterprise application development framework.

    ...(Roth is an Irish word pronounced 'row-h' which means 'wheel' -- as in "Why reinvent the wheel?" -- "To make it better!")
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    PyTorch Natural Language Processing

    PyTorch Natural Language Processing

    Basic Utilities for PyTorch Natural Language Processing (NLP)

    ...Now you've setup your pipeline, you may want to ensure that some functions run deterministically. Wrap any code that's random, with fork_rng and you'll be good to go. Now that you've computed your vocabulary, you may want to make use of pre-trained word vectors to set your embeddings.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    Win32Forth is ANS compatible, Forth language application development system with many tools; Interactive console, integrated extensible debugger, a GUI file editor, hypertext rendering, hyperlinked source files. VIEW <word-name> to explore the many files
    Leader badge
    Downloads: 58 This Week
    Last Update:
    See Project
  • 18
    SpeedEULA

    SpeedEULA

    Magyar szövegszerkesztő

    Sziasztok! Ez egy magyar szövegszerkesztő program lenne! PRO licenc kód: 74HVR-7ENS9-NDH73-HDM48 Hivatalos discord szerver: https://discord.gg/VUw6DkZ
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    buddidictionary

    buddidictionary

    An English to Sinhala Dictionary with Morphological Processing

    Buddidictionary is an English to Sinhala bilingual dictionary embed with English and Sinhala Morphological analysis. the system has been developed as a part of the EnSiMaS Project which is capable to translate English sentence into Sinhala. System has been developed through the MaSMT MUlti agent system development framework
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    aeneas

    aeneas

    Automagically synchronize audio and text (aka forced alignment)

    aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment). aeneas automatically generates a synchronization map between a list of text fragments and an audio file containing the narration of the text. In computer science this task is known as (automatically computing a) forced alignment.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 21
    node2vec

    node2vec

    Learn continuous vector embeddings for nodes in a graph using biased R

    ...By bridging ideas from graph theory and word embedding models, this project demonstrates how graph-based machine learning can be made efficient and flexible.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 22

    JCLTP

    A Java Class Library for Text Processing

    JCLTP is a class library designed for processing text. JCLTP is free, open source and developed with the Java programming language. JCLTP is distributed under the GNU license. It incorporates several technologies that enable process information while applying AI techniques, in order to build predictive models for text classification. Through a flexible structure of interfaces and classes, the opportunity to extend, adapt and add functionality JCLTP is provided. Thus, analysis of new types...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Notepad3

    Notepad3

    Light-weight Scintilla-based text editor with syntax highlighting

    Notepad3 is a fast and light-weight Scintilla-based text editor with syntax highlighting. Notepad3 is an excellent replacement for the default Windows text editor. Notepad3 offers many extra features over Notepad. It has a small memory footprint, but is powerful enough to handle most programming jobs.
    Downloads: 37 This Week
    Last Update:
    See Project
  • 24
    Sindhi Word Editor

    Sindhi Word Editor

    Its an Unicode based Sindhi Editor for formating sindhi documents.

    Sindhi Word Editor is another application developed by Zeeshan Bhatti, that allows basic word processing and editing functionality. It's an easy to use simple editor that allows the user to enter Sindhi text directly into the editor, without the need of installing any regional Language, Fonts, or any external keyboard support.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    Alida

    Alida

    An annotation based Java Validation Framework for Java Classes

    - Alida is a lightweight API for validating Java objects. -“Alida” comes from pruning the word “vALIDAtion”. It also means a “Winged creature”. It can give programmers “wings” by delegating the validation to another API. - Alida uses only annotations for ‘marking up’ the objects which need to be validated. - It is packaged as a jar file (alida.jar) which can be dropped in the classpath and used. It can also be configured with eclipse’s annotation processing to give complier errors and warnings at the time of usage...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
MongoDB Logo MongoDB