Showing 112 open source projects for "python text parser"

View related business solutions
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 1
    PaperQA2

    PaperQA2

    High accuracy RAG for answering questions from scientific documents

    PaperQA2 is a package for doing high-accuracy retrieval augmented generation (RAG) on PDFs or text files, with a focus on the scientific literature. See our recent 2024 paper to see examples of PaperQA2's superhuman performance in scientific tasks like question answering, summarization, and contradiction detection. In this example we take a folder of research paper PDFs, magically get their metadata - including citation counts and a retraction check, then parse and cache PDFs into a...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    bilingual_book_maker

    bilingual_book_maker

    Make bilingual epub books Using AI translate

    bilingual_book_maker is an AI-assisted translation tool for creating bilingual and multilingual versions of books and text files. It is designed to process formats such as EPUB, TXT, SRT, and PDF, then generate translated output that helps readers compare the original text with the target language. The project supports multiple AI providers and models, including OpenAI-compatible models and other translation backends through LiteLLM-style integrations.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 3
    pyVideoTrans

    pyVideoTrans

    Translate the video from one language to another and embed dubbing

    pyVideoTrans is an ambitious open-source multimedia processing project that assembles speech recognition, subtitle generation, AI translation, voice synthesis, and video assembly into a unified pipeline for converting videos from one language to another with embedded dubbing and captions. At its core it runs speech-to-text models to transcribe audio tracks, translates the resulting text into a target language using local or cloud-based translation engines, synthesizes new speech to match the...
    Downloads: 17 This Week
    Last Update:
    See Project
  • 4
    go1pylib

    go1pylib

    go1pylib is a Python library designed to control the Go1 robot

    go1pylib is a Python library designed to control the Go1 robot by Unitree Robotics. It provides an easy-to-use interface for robot movement, state management, collision avoidance, battery monitoring, and MQTT communication. Ideal for research and robotics development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    wttr.in

    wttr.in

    The right way to check the weather

    wttr.in is a console-oriented weather forecast service that supports various information representation methods like terminal-oriented ANSI-sequences for console HTTP clients (curl, httpie, or wget), HTML for web browsers, or PNG for graphical viewers. Originally started as a small project, a wrapper for wego, intended to demonstrate the power of the console-oriented services, wttr.in became a popular weather reporting service, handling tens of millions of queries daily.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Violin

    Violin

    Open-source Video Translation Skill

    Violin is an open-source video translation and dubbing tool that turns existing videos into localized versions with translated voice-over and optional subtitles. It transcribes the original speech, translates the text, generates natural-sounding speech in the target language, and remuxes the new audio back into the video. The project is designed to keep the generated speech aligned with the original timing so the final result feels closer to a real dubbed video. It can be used from the...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    MathCode

    MathCode

    A Frontier Mathematical Coding Agent

    MathCode is a terminal-based AI coding assistant focused on mathematical formalization and theorem proving. It is designed to transform plain-language mathematical reasoning into verified Lean 4 code and formal proofs. The project combines AI agents with Lean Language Server Protocol integration, allowing it to inspect compiler feedback, search for lemmas, and iteratively repair failed proof attempts. It supports an agentic proving workflow where the system behaves more like an interactive...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    AlohaMini

    AlohaMini

    Open-Source Dual-Arm Mobile Robot with Motorized Lift

    AlohaMini is a lightweight self-hostable productivity and note-taking platform designed to help individuals organize thoughts, tasks, and knowledge in a fast, minimal interface that avoids unnecessary complexity. It combines hierarchical notebooks, rich text editing, and tagging, allowing users to structure information the way they think and refine that structure over time. The platform focuses on speed and responsiveness, with a UI that loads instantly and feels smooth even on modest...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    Tokenized Text Aligner

    Aligns tokens in two versions of a text with differing tokenization.

    This tool performs token-by-token alignment of two versions of a text with differing tokenization by interpreting the results of a file diff (https://docs.python.org/3/library/difflib.html). It is intended for use in the preparation of annotated linguistic corpora, where differences in tokenization may arise (i) following corrections or modifications to the source text or (ii) through the creation of different layers of annotation (part-of-speech, treebank) requiring different tokenization....
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 10
    asammdf

    asammdf

    Fast Python reader and editor for ASAM MDF / MF4 (Measurement Format)

    *asammdf* is a fast Python parser and editor for ASAM (Associtation for Standardisation of Automation and Measuring Systems) MDF / MF4 (Measurement Data Format) files. It supports MDF versions 2 (.dat), 3 (.mdf) and 4 (.mf4). *asammdf* works on Python 2.7, and Python >= 3.4
    Leader badge
    Downloads: 46 This Week
    Last Update:
    See Project
  • 11
    Libro

    Libro

    An interactive program for statistical analysis of texts

    A cross-platform text analysis program written in Python and Free Pascal/Lazarus which scans a whole text file (in plain text, HTML, EPUB, or ODT formats) and ranks all used words according to frequency, performing a quantitative analysis of the text using Shannon-Weaver information statistic and Zipf power law function. It counts words, sentences, chars, spaces, and syllables.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Asymptote

    Asymptote

    2D & 3D TeX-Aware Vector Graphics Language

    Asymptote is a powerful descriptive vector graphics language for technical drawing, inspired by MetaPost but with an improved C++-like syntax. Asymptote provides for figures the same high-quality typesetting that LaTeX does for scientific text.
    Leader badge
    Downloads: 187 This Week
    Last Update:
    See Project
  • 13
    WordCount

    WordCount

    Count frequency of single, 2-word and 3-word clusters in a text

    The program can read a text file and count the occurrences of single words and clusters of 2 and 3 words. The resulting list will be sorted in descending order (highest frequency on top).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    Graphical Forecast Editor

    Graphical Forecast Editor

    View and edit grids of meteorological values

    Allows viewing and editing of weather data from weather models as two dimensional grids. Python scripting support hooks in the ingest, editing and product creation subsystems allow for customization. All network communication made through HTTP services. Enhanced version of the system used by national weather services.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    CSVSplitter
    # CSV Splitter Uma ferramenta para dividir arquivos CSV em múltiplos arquivos com base na quantidade de registros especificada, mantendo a integridade dos dados e permitindo configurações de charset, separador e formatação. Ideal para lidar com grandes arquivos CSV que precisam ser fragmentados para melhor manuseio e processamento. ## Funcionalidades - **Divisão de CSV**: Divide o arquivo original em múltiplos arquivos CSV, com o número de registros por arquivo definido pelo...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    pyLogos

    Qualitative content analysis software.

    pyLogos is a program to support text content analysis. Documents (imported from txt and docx files) are stored in a database, and may have marked text segments associated with codes. It is possible to retrieve these segments in various ways, generate word clouds, tabulate frequency of codes and words, among other outputs. pyLogos é um programa de apoio à análise de conteúdo de textos. Documentos (importados de arquivos txt e docx) são armazenados numa base de dados, podendo ter segmentos...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17

    dockserver-talk

    Python module to interface with a Slocum glider dockserver

    ...Surfalarm: a program that can send text messages or make direct phone calls, acting upon a glider to surface and connect to a dockserver, being in a mission, or aborted. Serial2rudics: a helper program that lets a glider simulator connected to a serial port appear as a glider connecting and disconnection via rudics/iridium when at ther surface or underwater, respectively.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Color to Word

    Color to Word

    Turn colors into words

    The program will turn a color into a list of 10 words, obtained according to a custom designed algorithm based on letter shape and position in the alphabet. - Click inside the frame on the left to pick a color through the color chooser window - The program will match the color with the colors corresponding to a list of all the English words contained in the file wordcolor.txt - The first 10 matches will appear in the frame on the right - Right-click - Copy to copy the word...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    QUAST

    QUAST

    Quality Assessment Tool for Genome Assemblies

    QUAST performs fast and convenient quality evaluation and comparison of genome assemblies. It is maintained by the Gurevich lab at HIPS (https://helmholtz-hips.de/en/hmsb). For the most up-to-date description, please visit http://quast.sf.net. Below are just some highlights. QUAST computes several well-known metrics, including contig accuracy, the number of genes discovered, N50, and others, as well as introducing new ones, like NA50 (see details in the paper and manual). A...
    Leader badge
    Downloads: 12 This Week
    Last Update:
    See Project
  • 20
    CAIRO for AERMOD

    CAIRO for AERMOD

    AERMOD, visualisation, input, modelling and compiling tool

    CAIRO for AERMOD v1.1 by MSc Dominik Subotić Simplified training software Avaliable: www.sourceforge.net/projects/cairo-for-aermod/ QGIS plugin: CAIROforAERMOD (Coming 2025.) Tutorial: https://www.youtube.com/watch?v=DZnsJuu1zLc AERMAP, AERMOD and AERPLOT analysis tool and input file compiler. Features: Automatic input by copying coordinates (Google Maps or text) and automatic conversion to UTM. Sources are automatically visualised in Google Earth. Input is done through user...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    CiteFlow

    CiteFlow

    Desktop research workspace for PDFs, notes, citations, bibliographies.

    CiteFlow is a focused desktop research workspace for students, researchers, and academic writers who want to manage PDFs, notes, citations, and bibliographies in one place. Create project-based workspaces for essays, articles, reports, literature reviews, and long-form research. Import PDFs, read them inside the app, search within documents, compare files side by side, highlight key passages, and add page-based notes. CiteFlow can assist with DOI metadata detection, keeps citation history...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Gem Measure

    Gem Measure

    Live gemstone symmetry & distortion measurement

    Gem Measure overlays an adjustable polygon onto a gemstone image or video feed and instantly calculates edge length deviations, symmetry, and overall quality. It's designed for jewelers, gemologists, and hobbyists who need quick, accurate measurements without expensive software. Works with USB microscopes, webcams, or any image file. How to Use Open an image (F) or connect a camera (C). Adjust the polygon by dragging its vertices to match the gem outline. Read deviation...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 23
    Advanced Trigonometry Calculator

    Advanced Trigonometry Calculator

    Precision Trigonometry: Advanced Calculator for Complex Math

    Advanced Trigonometry Calculator is equipped with a user-friendly interface that allows for easy input of problems and instant computation. Professionals such as engineers who need to perform advanced trigonometric calculations in their work will find this tool extremely useful. ATC Online Alpha: https://advantrigoncalc.sourceforge.io/atc/ More info by clicking below: https://advantrigoncalc.sourceforge.io/ Advanced Trigonometry Calculator was only and always only developed by...
    Leader badge
    Downloads: 10 This Week
    Last Update:
    See Project
  • 24

    PythonStarSplitter

    A Python Script I made to split a starfield image into several layers.

    A Python Script I made to split a starfield image into several layers. To be able to use the script, PixInsight with an installed Gaia data catalogue is required, as it needs the exported astrometry data text file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    MITRE Annotation Toolkit

    A toolkit for managing and manipulating text annotations

    The MITRE Annotation Toolkit (MAT) is a suite of tools which can be used for automated and human tagging of annotations. Annotation is a process, used mostly by researchers in natural language processing, of enhancing documents with information about the various phrase types the documents contain. MAT supports both UI interaction and command-line interaction, and provides various levels of control over the overall annotation process. It can be customized for specific tasks (e.g.,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB