Search Results for "text processing" - Page 22

Showing 1744 open source projects for "text processing"

View related business solutions
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Let your crypto work for you

    Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    Vrapper

    Vrapper

    Vim-like editing in Eclipse

    Vrapper is an eclipse plugin which acts as a wrapper for existing eclipse text editors to provide a Vim-like input scheme for moving around and editing text. Eclipse Update Site: http://vrapper.sourceforge.net/update-site/stable
    Downloads: 13 This Week
    Last Update:
    See Project
  • 2
    QStripper
    Convert Sinclair QL word processing (Quill etc) documents to pdf, html, text and DocBook XML.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    cnn-text-classification-tf

    cnn-text-classification-tf

    Convolutional Neural Network for Text Classification in Tensorflow

    The cnn-text-classification-tf repository by Denny Britz is a well-known educational implementation of convolutional neural networks for text classification using TensorFlow, aimed at helping developers and researchers understand how CNNs can be applied to natural language processing tasks. Based loosely on Kim’s influential paper on CNNs for sentence classification, this codebase demonstrates how to preprocess text data, convert words into learned embeddings, and apply multiple convolution filters to extract n-gram features that are then pooled and fed into a classifier. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Mavscript

    Mavscript

    Calculations in a text document

    Mavscript allows the user to do calculations in a text document. Plain text, LaTeX and OpenOffice Writer files (.odt) are supported. The calculation is done by the algebra system Yacas (default), Jasymca or by the Java interpreter BeanShell.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    iText®, a JAVA PDF library

    iText®, a JAVA PDF library

    PDF Library for Developers

    iText is an open-source PDF library available for Java and .NET (C#). iText allows you to effortlessly generate and manipulate standards-compliant PDF documents with a powerful and feature-rich SDK. With iText, you can create archivable and accessible PDFs, split and merge documents, fill and flatten forms, digitally sign documents, and more. iText add-ons enable additional functionality, such as PDF creation from HTML templates, secure redaction, OCR, and much more. The latest...
    Leader badge
    Downloads: 202 This Week
    Last Update:
    See Project
  • 6
    TeXML is an XML vocabulary for TeX. The processor transforms the TeXML markup into the TeX markup, escaping special and out-of-encoding characters. The intended audience is developers who automatically generate [La]TeX or ConTeXt files.
    Leader badge
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    PDF To Text Watcher

    PDF To Text Watcher

    Profile-based watcher for automated processing of PDF tiles to text.

    Watches folders to automate transforming PDF files into text with optional metadata extraction. Requires the XPDF tools, which you must source separately. Lets you set up multiple profiles, modify profiles 'hot' without saving and move or delete the source PDFs after processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Command-line/Ant-task/embeddable text file preprocessor. Macros, flow control, expressions. Recursive directory processing. Extensible in Java to display data from any data sources (as database). Can generate complete homepages (tree of HTML-s, images, etc.)
    Leader badge
    Downloads: 108 This Week
    Last Update:
    See Project
  • 9
    unfluff

    unfluff

    Automatically extract body content (and other cool stuff) from HTML

    unfluff is a Node.js library designed to automatically extract the main content from an HTML document — stripping away navigation bars, ads, footers and other boilerplate to leave you with the “body content”, metadata (title, author, date) and other useful fields. It’s a tool very much aimed at content-analysis, web scraping, building datasets, or repurposing article text for downstream processing (like machine-learning or summarization). The API is simple: you feed in raw HTML and it returns a structured object with the extracted text and other fields. It supports caching internal representations to speed up repeated extractions. While its language support is best for English, it is still widely used in web-content-processing pipelines. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10
    lottie vectors

    lottie vectors

    Create, display and process 2D vectors in a 3D window.

    Lottie Vectors is an application for Matlab that alows you to do some pretty neat things -with vectors. More exactly -displaying them in ways that hopefully will allow you to explore and better understand your vector data. The basic idea is simple, take a vector defined in one of a few different types of data formats and map it on the screen. Add another vector and you start to form a 'route'. Each route or position vector can be accompanied with a 'force' vector. This can be used to...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    mzitu

    mzitu

    Python crawler that downloads image galleries and analyzes titles

    ...It focuses on automating the collection of large sets of images by programmatically parsing page content and iterating through gallery entries. mzitu also includes a simple analysis script that processes downloaded folder names to generate statistics and visualizations. Using text segmentation and frequency analysis, the project can create a word cloud representing common keywords found in the dataset. This makes the repository both a scraping example and a small data analysis experiment built around the collected content. Overall, mzitu serves as a learning-oriented implementation of Python web scraping, data processing, and visualization techniques.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Create beautiful song books for your church or fellowship using this LaTeX package and related tools.
    Leader badge
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    FireTeX: LaTeX Editor and Compiler

    FireTeX: LaTeX Editor and Compiler

    Edit Your files LaTeX and tex

    FireTeX, web based LaTeX editor complete, is a powerful, intuitive and stocked with useful functions for exporting the results in three useful formats. An editor with LaTeX compiler, highlight code, advanced search / replace and filesystem API HTML5. ======== Android app available on Play Store > https://play.google.com/store/apps/details?id=com.ulmdesign.ulmtex ======== Update 30.06.2017 Windows 7 and later and macOS 10.9 and later are supported. == Browser Extensions == Add-on...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    XML - JMS/JSON

    XML - JMS/JSON

    Converter utility between XML text and Javascript object / JSON text

    Convert XML text to Javascript object / JSON text (and vice versa). Can parse: elements, attributes, texts, comments, CData, DOCTYPE, XML declarations, and Processing Instructions. Usually, the output of XML attributes are stored in @attr, _atrr, $attr, or $ in order to avoid conflicting with the name of sub-elements. This library stores them in attributes, but most importantly, you can change this to whatever you like.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    DSTK - Data Science TooKit 3

    DSTK - Data Science TooKit 3

    Data and Text Mining Software for Everyone

    DSTK - Data Science Toolkit 3 is a set of data and text mining softwares, following the CRISP DM model. DSTK offers data understanding using statistical and text analysis, data preparation using normalization and text processing, modeling and evaluation for machine learning and algorithms. It is based on the old version DSTK at https://sourceforge.net/projects/dstk2/ DSTK Engine is like R.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    JSesh is an ancient Egyptian hieroglyphic text processor, currently used by professionnals and amateurs alike. It runs on all platforms supporting java (Mac, Windows, Linux). It can be used as a library for your own softwares too. SOURCES HAVE MOVED TO GITHUB: https://github.com/rosmord/jsesh
    Downloads: 10 This Week
    Last Update:
    See Project
  • 17
    DeepLearn

    DeepLearn

    Implementation of research papers on Deep Learning+ NLP+ CV in Python

    Welcome to DeepLearn. This repository contains an implementation of the following research papers on NLP, CV, ML, and deep learning. The required dependencies are mentioned in requirement.txt. I will also use dl-text modules for preparing the datasets. If you haven't use it, please do have a quick look at it. CV, transfer learning, representation learning.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    cebe/markdown

    cebe/markdown

    A super fast, highly extensible markdown parser for PHP

    cebe/markdown is a super fast, highly extensible markdown parser for PHP. It is a set of PHP classes, each representing a Markdown flavor and a command line tool for converting Markdown files to HTML files. The implementation focus is to be fast (see benchmark) and extensible. You are able to add additional language elements by directly hooking into the parser, no (possibly error-prone) post- or pre-processing is needed to extend the language. It is also well-tested to provide the best...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Clu-Linux-Live

    Clu-Linux-Live

    Various Processing and Data Rescue Tools over Wired or Wireless Networ

    This Linux Live CD provides Various Processing Command Line Utilities (Clu) and Data Rescue Tools which can be used on a Wired or Wireless Network. On Startup it prompts the user to change password, mount all filesystems available locally, start wireless network ( if wifi interface present ), start network services (samba/ssh/sftp) and present user with a console for executing various utilities i.e Text, Image, Audio, Video, Downloading etc. on their FileSystems that are mounted. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 20
    JUSH is a syntax highlighting component written in JavaScript. It highlights HTML, CSS, JS, PHP and SQL code embedded into each other. Beside syntax highlighting, it provides links to the documentation for all supported languages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Deeplearning-papernotes

    Deeplearning-papernotes

    Summaries and notes on Deep Learning research papers

    Deeplearning-papernotes is an implementation of Convolutional Neural Networks for sentence and text classification in TensorFlow, based on a well-known research paper that applies CNN architectures to natural language processing tasks with strong performance in sentiment analysis and similar classification problems. The repository provides the complete network definition, including an embedding layer to convert words into dense representations, convolution and max-pooling layers to extract informative features, and a final softmax classifier to distinguish between target classes. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    IceNLP is an open source Natural Language Processing (NLP) toolkit for analyzing and processing Icelandic text. The toolkit is implemented in Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    cpDetector is a proxy for codepage detection of documents. It delegates to multiple instances that try to detect the codepage by different techinques. A command line executeable is shipped that allows to sort documents by codepage.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 24

    Indexmeister

    automatic indexing for large LaTex documents

    Indexmeister reads a variety of formats (.tex, .docx, .epub, and others) and suggests keywords for indexing. The included program Imbrowse provides a semi-automatic interface to rapidly add index tags to multi-file latex documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    JETT

    JETT - Java Excel Template Translator

    JETT is a Java 5.0 API that reads an Excel spreadsheet as a template, takes your data, and creates a new Excel spreadsheet that contains your data, formatted as in the template. It works with .xls and .xlsx template spreadsheets.
    Downloads: 1 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB