Search Results for "python text parser" - Page 25

Showing 1622 open source projects for "python text parser"

View related business solutions
  • Keep company data safe with Chrome Enterprise Icon
    Keep company data safe with Chrome Enterprise

    Protect your business with AI policies and data loss prevention in the browser

    Make AI work your way with Chrome Enterprise. Block unapproved sites and set custom data controls that align with your company's policies.
    Download Chrome
  • Level Up Your Cyber Defense with External Threat Management Icon
    Level Up Your Cyber Defense with External Threat Management

    See every risk before it hits. From exposed data to dark web chatter. All in one unified view.

    Move beyond alerts. Gain full visibility, context, and control over your external attack surface to stay ahead of every threat.
    Try for Free
  • 1
    The CSS Parser is implemented as a package of Java classes, that inputs Cascading Style Sheets source text and outputs a Document Object Model Level 2 Style tree. Alternatively, applications can use SAC: The Simple API for CSS. Its purpose is to allow developers working with Java to incorporate Cascading Style Sheet information, primarily in conjunction with XML application developments.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    Automatic text summarizer

    Automatic text summarizer

    Module for automatic summarization of text documents and HTML pages

    Sumy is an automatic text summarization library that provides multiple algorithms for extracting key content from documents and articles. Simple library and command line utility for extracting summary from HTML pages or plain texts. The package also contains a simple evaluation framework for text summaries. Implemented summarization methods are described in the documentation. I also maintain a list of alternative implementations of the summarizers in various programming languages.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Point-E

    Point-E

    Point cloud diffusion for 3D model synthesis

    point-e is the official repository for Point-E, a generative model developed by OpenAI that produces 3D point clouds from textual (or image) prompts. Its principal advantage is speed: it can generate 3D assets in just 1–2 minutes on a single GPU, which is significantly faster than many competing text-to-3D models. The model works via a two-stage diffusion approach: first, it uses a text → image diffusion network to produce a synthetic 2D view consistent with the prompt; then a second...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Minimal text diffusion

    Minimal text diffusion

    A minimal implementation of diffusion models for text generation

    A minimal implementation of diffusion models of text: learns a diffusion model of a given text corpus, allowing to generate text samples from the learned model. The main idea was to retain just enough code to allow training a simple diffusion model and generating samples, remove image-related terms, and make it easier to use. To train a model, run scripts/train.sh. By default, this will train a model on the simple corpus. However, you can change this to any text file using the --train_data...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    The database for AI-powered applications.

    MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
    Start Free
  • 5
    CPT

    CPT

    CPT: A Pre-Trained Unbalanced Transformer

    A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation. We replace the old BERT vocabulary with a larger one of size 51271 built from the training data, in which we 1) add missing 6800+ Chinese characters (most of them are traditional Chinese characters); 2) remove redundant tokens (e.g. Chinese character tokens with ## prefix); 3) add some English tokens to reduce OOV. Position Embeddings We extend the max_position_embeddings from 512 to 1024. We...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    SPyQL

    SPyQL

    Query data on the command line with SQL-like SELECTs powered by Python

    SQL with Python in the middle. SPyQL is a query language that combines the simplicity and structure of SQL with the power and readability of Python. SPyQL offers a command-line interface that allows running SPyQL queries on top of text data (e.g. CSV, JSON). Data can come from files but also from data streams, such as as Kafka, or from databases such as PostgreSQL.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Hydrogen

    Hydrogen

    Run code interactively, inspect data, and plot

    Hydrogen is an interactive coding environment that supports Python, R, JavaScript and other Jupyter kernels. Hydrogen was inspired by Bret Victor's ideas about the power of instantaneous feedback and the design of Light Table. Running code inline and in real-time is a more natural way to develop. By bringing the interactive style of Light Table to the rock-solid usability of Atom, Hydrogen makes it easy to write code the way you want to. You also may be interested in our latest project –...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Auto-Typer

    Auto-Typer

    Hooks your keys to only type the thing you selected

    Hooks your keys to only type the thing you selected. Run the command line script. Enter the number of times to repeat the text. Enter wherever to reverse the text or not. Enter what characters the text should contain (Enter for everything). Enter what characters the text should begin with (Enter for everything). Press Windows Shift S and select the area of the text. Press Enter to detect the text. Wait until the program displays the text. Press Enter to start typing. Then start typing, the...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9
    Karlo

    Karlo

    Text-conditional image generation model based on OpenAI's unCLIP

    Karlo is a text-conditional image generation model based on OpenAI's unCLIP architecture with the improvement over the standard super-resolution model from 64px to 256px, recovering high-frequency details only in the small number of denoising steps. We train all components from scratch on 115M image-text pairs including COYO-100M, CC3M, and CC12M. In the case of Prior and Decoder, we use ViT-L/14 provided by OpenAI’s CLIP repository. Unlike the original implementation of unCLIP, we...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Cloud-based help desk software with ServoDesk Icon
    Cloud-based help desk software with ServoDesk

    Full access to Enterprise features. No credit card required.

    What if You Could Automate 90% of Your Repetitive Tasks in Under 30 Days? At ServoDesk, we help businesses like yours automate operations with AI, allowing you to cut service times in half and increase productivity by 25% - without hiring more staff.
    Try ServoDesk for free
  • 10
    Repeat

    Repeat

    Mouse/keyboard record/replay and automation hotkeys/macros creation

    Full-fledged mouse/keyboard record/replay and automation hotkeys/macros creation using modern programming languages, and more advanced automation features. Working across three major OSes: Windows, OSX, and Linux. See more at https://github.com/repeats/Repeat Repeat yourself with some intelligence. This, if used correctly, can improve your productivity greatly.
    Leader badge
    Downloads: 67 This Week
    Last Update:
    See Project
  • 11
    truckLog

    truckLog

    Create and print a log for delivery stops.

    New Version For Python 3 **This program requires a web server (such as Apache) and python3.x. ** TruckLog will generate a printable log that will show a line by line list of delivery stops to be made for any one truck load with space provided for signatures. Any amount of logs can be made for each day. The lines can be deleted or moved singly or in groups.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Word segmentation utility for Thai language written in C
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    d2l-zh

    d2l-zh

    Chinese-language edition of Dive into Deep Learning

    d2l‑zh is the Chinese-language edition of Dive into Deep Learning, an interactive, open‑source deep learning textbook that combines code, math, and explanatory text. It features runnable Jupyter notebooks compatible with multiple frameworks (e.g., PyTorch, MXNet, TensorFlow), comprehensive theoretical analysis, and exercises. Widely adopted in over 70 countries and used by more than 500 universities for teaching deep learning.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    AFK Apex Bot

    AFK Apex Bot

    A program that allows the user to AFK in Apex Legends games

    A program that allows the user to AFK in Apex Legends games in order to either lower their KD or get XP. This is especially useful for finishing the battlepass or getting an heirloom as it can get a substantial amount of XP in a short period of time. Only works in 1080P and 1440P (Your primary monitor for Apex has to be one of these resolutions, check quick fixes at the bottom for a workaround.). Also, it is assumed that your jump key is space and your move forward key is w but allows you to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    pdf-extractor

    pdf-extractor

    Node.js module for rendering pdf pages to images, svgs and HTML files

    Pdf-extractor is a wrapper around pdf.js to generate images, svgs, html files, text files and json files from a pdf on node.js. A DOM Canvas is used to render and export the graphical layer of the pdf. Canvas exports *.png as a default but can be extended to export to other file types like .jpg. Pdf objects are converted to svg using the SVGGraphics parser of pdf.js. Pdf text is converted to HTML. This can be used as a (transparent) layer over the image to enable text selection. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    stml

    stml

    Indentation Procedure in HTML

    Functionalities of the STML * Indentation Procedure in HTML * Indentation Procedure in HTML * Better Implementation for Python Coders & others * Reduce usage of closing tags
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    Work Case Toolkit 0.4 beta 1

    Support variant based Java program execution, adapted to support MDE.

    Work Case Toolkit is a tool that provides variant based program's execution. Our main goal is to define an API that we can use embedded into any Java application to support dynamic execution based on variant flows. The variants are controlled by a graph decision using features diagrams relationships. The project was originally planned in 2005 to support Model Driven Architecture OMG approach, focusing specially on the management of architectural changes and models transformers. The main...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    rust.vim

    rust.vim

    Vim configuration for Rust

    rust.vim is the official Vim plugin from the Rust Lang Foundation team that adds support for Rust files in Vim and Neovim. It provides file-detection (so .rs files are recognised), syntax highlighting tuned for Rust’s syntax and macros, and supports formatting via rustfmt and other language-specific tooling. The plugin also integrates with other Vim tools and linters, helping you build a full Rust-editing workflow in Vim. Because Rust’s macro system and language features are more complex...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    brTxter

    brTxter

    Powerful and free source code editor for creative developers.

    From creative, for creative. Name: Txter Verifying name / forum name: brTxter Stable version: 0.8.0 Fresh version: 0.8.0 State: release candidate 1 Supported platforms: Windows 7 or higher; macOS High Sierra or higher (on both Intel chips and Apple Silicon Chips)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    crossplane

    crossplane

    Quick and reliable way to convert NGINX configurations into JSON

    Reliable and fast NGINX configuration file parser and builder. Since crossplane is usually used to create payloads that are sent to different servers, it's important to keep security in mind. For that reason, the --ignore option was added. It can be used to keep certain sensitive directives out of the payload output entirely.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    HASS Configurator

    HASS Configurator

    Configuration UI for Home Assistant

    ...The configurator fetches JavaScript libraries, CSS and fonts from CDNs. Hence it does NOT work when your client device is offline. And it is only available for Python 3.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Web Spider, Web Crawler, Email Extractor

    Web Spider, Web Crawler, Email Extractor

    Free Extracts Emails, Phones and custom text from Web using JAVA Regex

    ...Extracts Information from Web by parsing millions of pages. Store data into Derby OR MySQL Database and data are not being lost after force closing the spider. - Free Web Spider , Parser, Extractor, Crawler - Extraction of Emails , Phones and Custom Text from Web - Export to Excel File - Data Saved into Derby Database - Written in Java Cross Platform See also Free Email Sender in this link: https://sourceforge.net/projects/gitst-free-email-ender/ Please install Microsoft OpenJDK to start the application https://www.microsoft.com/openjdk
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    Email-IP

    Email-IP

    Email and save on a cloud folder your IP for remote access

    The program will check the public Internet IP of the machine where it runs and email it to an email address of your choice. It can also save on a cloud folder a text file containing a log with the current IP and a timestamp of online/offline status, so you can read the IP by accessing your cloud. Useful for accessing remotely an FTP server, a web server or another service hosted on your computer.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    DiffSinger

    DiffSinger

    Singing Voice Synthesis via Shallow Diffusion Mechanism

    DiffSinger is an open-source PyTorch implementation of a diffusion-based acoustic model for singing-voice synthesis (SVS) and also text-to-speech (TTS) in a related variant. The core idea is to view generation of a sung voice (mel-spectrogram) as a diffusion process: starting from noise, the model iteratively “denoises” while being conditioned on a music score (lyrics, pitch, musical timing). This avoids some of the typical problems of prior SVS models — like over-smoothing or unstable GAN...
    Downloads: 18 This Week
    Last Update:
    See Project
  • 25

    OpenOffice.org Utility Library

    Library modules for creating ODF documents.

    OpenOffice.org Utility Library modules for creating Open Document Format (ODF) documents which can be read by Office Suites including OpenOffice.org, LibreOffice.org, and Microsoft Office. Currently, ooolib-python can create Calc spreadsheet ODS documents. These documents include many features including: - Create multiple table spreadsheets - Cells with text, numbers, dates, formulas - Ability to use built-in styles - Ability to create automatic styles (ie. bold, italics, underline, font size, font color, background color, etc.) - Set column and row attributes including width and height I am currently hosting the new development code at the following URL: https://github.com/josephcolton/ooolib-python I will be posting releases here on SourceForge as well as GitHub, so you can come to either place for the releases. ...
    Downloads: 0 This Week
    Last Update:
    See Project