Showing 63 open source projects for "text tools"

View related business solutions
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 1
    q - Text as Data

    q - Text as Data

    Run SQL directly on CSV or TSV files

    q is a command line tool that allows direct execution of SQL-like queries on CSVs/TSVs (and any other tabular text files). q treats ordinary files as database tables, and supports all SQL constructs, such as WHERE, GROUP BY, JOINs etc. It supports automatic column name and column type detection, and provides full support for multiple encodings. q fully supports all types of encoding. Use -e data-encoding to set the input data encoding, -Q query-encoding to set the query encoding, and use -E...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 2
    Big Sleep

    Big Sleep

    A simple command line tool for text to image generation

    A simple command line tool for text to image generation, using OpenAI's CLIP and a BigGAN. Ryan Murdock has done it again, combining OpenAI's CLIP and the generator from a BigGAN! This repository wraps up his work so it is easily accessible to anyone who owns a GPU. You will be able to have the GAN dream-up images using natural language with a one-line command in the terminal. User-made notebook with bug fixes and added features, like google drive integration. Images will be saved to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    fastNLP

    fastNLP

    fastNLP: A Modularized and Extensible NLP Framework

    ...A unified Tabular data container simplifies the data preprocessing process. Built-in Loader and Pipe for multiple datasets, eliminating the need for preprocessing code. Various convenient NLP tools, such as Embedding loading (including ELMo and BERT), intermediate data cache, etc.. Provide a variety of neural network components and recurrence models (covering tasks such as Chinese word segmentation, named entity recognition, syntactic analysis, text classification, text matching, metaphor resolution, summarization, etc.). Trainer provides a variety of built-in Callback functions to facilitate experiment recording, exception capture, etc. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    srt2vobsub

    A command-line tool that converts text subtitles to vobsub

    A command line tool for Linux and other Unix-like systems that generates a pair of .idx/.sub vobsub subtitles files from textual subtitles.
    Leader badge
    Downloads: 3 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    PyTorch Book

    PyTorch Book

    PyTorch tutorials and fun projects including neural talk

    This is the corresponding code for the book "The Deep Learning Framework PyTorch: Getting Started and Practical", but it can also be used as a standalone PyTorch Getting Started Guide and Tutorial. The current version of the code is based on pytorch 1.0.1, if you want to use an older version please git checkout v0.4or git checkout v0.3. Legacy code has better python2/python3 compatibility, CPU/GPU compatibility test. The new version of the code has not been fully tested, it has been tested...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    A Complete Beginner's Guide to Django

    A Complete Beginner's Guide to Django

    A Complete Beginner's Guide to Django - Code Samples

    ...I wanted to do something different. A tutorial that would be easy to follow, informative and fun to read. That was when I came up with the idea to create some comics along the text to illustrate some concepts and scenarios. I hope you enjoy the reading! Django is a Web framework written in Python. A Web framework is a software that supports the development of dynamic Web sites, applications, and services. It provides a set of tools and functionalities that solves many common problems associated with Web development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    itchat

    itchat

    A complete and graceful API for Wechat

    itchat is an open-source WeChat personal account interface, it has never been easier to use python to call WeChat. With less than thirty lines of code, you can have a WeChat bot that handles all the information. Of course, the use of this api is far more than a robot, and more functions are waiting for you to discover. This interface shares a similar operation method with the public account interface itchatmp , learning to master two tools at a time. Now WeChat has become a large part of...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    aeneas

    aeneas

    Automagically synchronize audio and text (aka forced alignment)

    aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment). aeneas automatically generates a synchronization map between a list of text fragments and an audio file containing the narration of the text. In computer science this task is known as (automatically computing a) forced alignment.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 9
    Crazy Eddies GUI System (CEGUI)

    Crazy Eddies GUI System (CEGUI)

    A fast, powerful and adaptable GUI solution

    Crazy Eddie's GUI (CEGUI) system is a graphical user interface C++ library. It was designed particularly for the needs of videogames, but the library is usable for non-game tasks, such as any other type of applications (rendering/visualisation/virtual reality) and tools. It is designed for user flexibility in look-and-feel, as well as being adaptable to the user's choice in tools and operating systems. Established in 2003, CEGUI sees continual, active development and remains one of the...
    Downloads: 47 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    PROJECT MOVED TO https://github.com/paulhtremblay/rtf2xml The script rtf2xml faithfully converts Microsoft's RTF format to structured XML. Developers can make further transformations using standard XML tools, or use the stylsheets provided to convert to sdocbook or TEI.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    Klonwerk

    advanced variability management tool

    Klonwerk is an advanced and easy-to-use tool for reactive variability management and large-scale software reuse. It allows you to automatically clone and modify directories of textural or binary files (e.g. code, architecture, documentation), based on conceptual feature/decision models.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Pootle, Virtaal & Translate Toolkit

    Pootle, Virtaal & Translate Toolkit

    Localization tools built by localizers for localizers

    Tools for localization: - Pootle: web based translation management system. - Virtaal: Computer Aided Translation (CAT) tool. - Translate Toolkit: QA, format conversion and support (PO, Java .properties, OpenOffice, Mozilla, XLIFF, TMX, TBX, CSV, Qt .ts).
    Leader badge
    Downloads: 70 This Week
    Last Update:
    See Project
  • 13

    mwetoolkit

    THIS PROJECT MIGRATED TO https://gitlab.com/mwetoolkit/mwetoolkit3/

    THIS PROJECT MIGRATED TO https://gitlab.com/mwetoolkit/mwetoolkit3/ The Multiword Expressions toolkit aids in the automatic identification and extraction of multiword units in running text. These include idioms (kick the bucket), noun compounds (cable car), phrasal verbs (take off, give up), etc. Even though it focuses on multiword expresisons, the framework is quite complete and can also be useful in any corpus-based study in computational linguistics. The mwetoolkit can be...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    COAR-DMS

    COAR-DMS

    DMS for linux, C++ library, server, webUI , SOAP

    COAR-DMS is document management system for 32/64 bit. linux. Acts as library, server and tools. Library features: - storage management, free pages recycling - transaction log - indexing: full text, tags, metadata, document attributes - inverted index - versioning, collaboration - document trees, trees versionning - folders - plugins for auth (PAM,LDAP), db, file types plugins - tags - metadata (key value pairs) - object level security, folders documents ACL, - unix like security (rwx), special authorities - from thousands to tens of billions of documents - dashboard (working copies, new documents) - electronic signs - search statement, syntax like SQL - multithreaded, multiprocess library, Servers: - native HTTP server (libmicrohttp) - SOAP server - WebDAV(planed) - Indexer Python API WebUI GWT, JSP, SOAP-API
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    isbntools

    A command line tool to extract, transform and get metadata for ISBNs

    As of 2015-06-02, this project is no longer under active development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Python Tkinter Extensions

    Python Tkinter Extensions

    Additional Graphical Classes and Widgets for Tkinter/Ttk

    This is a Python package with extensions for Tkinter. It supports both Python 2.7 and Python 3.3. Currently, the project is in version 1.0 of the Production stage. Feel free to fork off the main project or send me possible additions to the package.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 17
    Voikko

    Voikko

    Library of linguistic tools

    Voikko is a spell checking, grammar checking, morphological analysis and hyphenation system. Spell checkers are available for multiple languages, other features for Finnish only.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 18
    Autotoolset is both a colection of small tools (scripts, emacs macros, etc.) to simplify project development with autoconf/automake/libtool/texinfo/emacs and a good documentation on how to get started on these tools and the whole GNU software model.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    Dvipdfm tool for SCons

    SCons tool to cooperate with dvipdfm program

    SCons is a make replacement providing a range of enhanced features such as automated dependency generation and built in compilation cache support. SCons rule sets are Python scripts so as well as the features it provides itself SCons allows you to use the full power of Python to control compilation. This is a SCons extension (tool) which enables usage of the dvipdfm program to convert dvi files to pdf.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    The Gedit "builder" plugin is a plugin for the Gnome Text Editor that allows you to compile source code and then click on the compiler output errors to go to those errors in the source code.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    python dagger

    File dependency graph evaluator in pure python.

    Dagger evaluates file dependencies in a directed-acyclic-graph (DAG) like GNU make, but timestamps or hashes can be used on a per-file basis when evaluating which file nodes are up to date or stale. Use fast timestamp comparisons on large files when hashing is too slow, and hashing on small files. When hashing is used, it's stored in either a text file or sqlite database (in-memory at runtime is optional). Files can be forced as stale or uptodate. Dagger can be used as a building block for a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Hylt is a file format that provides HYperLinked Text with minimal markup, meant to be both easy to view in any program and easy to keep in a version control system. The Hylt project covers the file specification, viewers, and tools to manage Hylt files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    qenqote

    Clipboard Quote Encoder

    qenqote is a Python/Gtk program to operate on snippets of text from the clipboard and put results back on the clipboard. This allows automation of common editing operations. In its prototypical application, qenqote accepts URL's that have been copied or cut to the clipboard from a browser bar. qenqote applies a user-selected quoting function, and writes the result back to the clipboard. Besides its URL transformation applications, qenqote can be given a variable name from...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    This is a simple tokenizer for converting source code in ascii text files into a ZX Spectrum loadable image file
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    A collection of open source libraries and tools that provide solutions for common problems in processing Arabic text, especially in web applications. text normalization, phrase segmentation, text indexing, stop word lists, common spelling mistakes.
    Downloads: 3 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB