64 projects for "python text parser" with 2 filters applied:

  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    Build gen AI apps with an all-in-one modern database: MongoDB Atlas

    MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.
    Start Free
  • No-Nonsense Code-to-Cloud Security for Devs | Aikido Icon
    No-Nonsense Code-to-Cloud Security for Devs | Aikido

    Connect your GitHub, GitLab, Bitbucket, or Azure DevOps account to start scanning your repos for free.

    Aikido provides a unified security platform for developers, combining 12 powerful scans like SAST, DAST, and CSPM. AI-driven AutoFix and AutoTriage streamline vulnerability management, while runtime protection blocks attacks.
    Start for Free
  • 1
    Papers We Love

    Papers We Love

    Papers from the computer science community to read and discuss

    Papers We Love (PWL) is a global open source community dedicated to reading, discussing, and sharing influential computer science research papers. The repository serves as a curated directory of academic papers that have shaped the field of computing, providing a centralized location for documents that were previously scattered across various online sources. While licensing restrictions prevent hosting all papers directly, PWL offers links to their original sources and clearly marks hosted...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    go1pylib

    go1pylib

    go1pylib is a Python library designed to control the Go1 robot

    go1pylib is a Python library designed to control the Go1 robot by Unitree Robotics. It provides an easy-to-use interface for robot movement, state management, collision avoidance, battery monitoring, and MQTT communication. Ideal for research and robotics development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3

    Tokenized Text Aligner

    Aligns tokens in two versions of a text with differing tokenization.

    This tool performs token-by-token alignment of two versions of a text with differing tokenization by interpreting the results of a file diff (https://docs.python.org/3/library/difflib.html). It is intended for use in the preparation of annotated linguistic corpora, where differences in tokenization may arise (i) following corrections or modifications to the source text or (ii) through the creation of different layers of annotation (part-of-speech, treebank) requiring different tokenization...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Asymptote

    Asymptote

    2D & 3D TeX-Aware Vector Graphics Language

    Asymptote is a powerful descriptive vector graphics language for technical drawing, inspired by MetaPost but with an improved C++-like syntax. Asymptote provides for figures the same high-quality typesetting that LaTeX does for scientific text.
    Leader badge
    Downloads: 194 This Week
    Last Update:
    See Project
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 5
    Libro

    Libro

    An interactive program for statistical analysis of texts

    A cross-platform text analysis program written in Python and Free Pascal/Lazarus which scans a whole text file (in plain text, HTML, EPUB, or ODT formats) and ranks all used words according to frequency, performing a quantitative analysis of the text using Shannon-Weaver information statistic and Zipf power law function. It counts words, sentences, chars, spaces, and syllables. Also computes readability indexes (Gunning-Fog, Coleman-Liau, Automated Readability Index (ARI), SMOG grade, Flesch...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    DataMelt

    DataMelt

    Computation and Visualization environment

    DataMelt (or "DMelt") is an environment for numeric computation, data analysis, computational statistics, and data visualization. This Java multiplatform program is integrated with several scripting languages such as Jython (Python), Groovy, JRuby, BeanShell. DMelt can be used to plot functions and data in 2D and 3D, perform statistical tests, data mining, numeric computations, function minimization, linear algebra, solving systems of linear and differential equations. Linear, non-linear...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    The FMI++ Library

    The FMI++ Library

    A High-level Utility Package for FMI-based Software Development

    The Functional Mock-up Interface (FMI) specification intentionally provides only the most essential and fundamental functionalities in the form of a C interface. On the one hand, this increases flexibility in use and portability to virtually any platform. On the other hand, such a low-level approach implies several prerequisites a simulation tool has to fulfill in order to be able to utilize such an FMI component. The FMI++ library addresses this problem for models and tools with...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    OpenNum

    OpenNum

    OpenNum lets you distribute solvers with a nice graphical interface

    Typically, to program a GUI is time consuming and requires experience with graphic libraries. OpenNum lets you create a graphical interface adapted to your solvers by simply editing an XML configuration file. More specifically, OpenNum lets you · to collect a hierarchical dataset, · to call any executable file and · to visualize scalar and vector fields, plot graphs or show simple plain text files. It also has other useful utilities specifically designed for numerical...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    A tool for managing bibliographic databases (BibTeX, Refer,...) through an user friendly interface.
    Leader badge
    Downloads: 4 This Week
    Last Update:
    See Project
  • Photo and Video Editing APIs and SDKs Icon
    Photo and Video Editing APIs and SDKs

    Trusted by 150 million+ creators and businesses globally

    Unlock Picsart's full editing suite by embedding our Editor SDK directly into your platform. Offer your users the power of a full design suite without leaving your site.
    Learn More
  • 10
    Leseratte is a Java parser for German written language. Currently, it contains a German lexicon (based on the Wiktionary), inflexion rules, a grammar and a parser. (Semantics component planned.) Usable as a Java library, also provides a graphical UI.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    Presage

    the intelligent predictive text entry platform

    Presage (formerly Soothsayer) is an intelligent predictive text entry system. Presage generates predictions by modelling natural language as a combination of redundant information sources. Presage computes probabilities for words which are most likely to be entered next by merging predictions generated by the different predictive algorithms. Presage's modular and extensible architecture allows its language model to be extended and customized to utilize statistical, syntactic, and semantic...
    Leader badge
    Downloads: 15 This Week
    Last Update:
    See Project
  • 12
    TEACUP

    TEACUP

    TCP Experiment Automation Controlled Using Python

    TEACUP automates many aspects of running TCP performance experiments in a specially-constructed physical testbed. TEACUP enables repeatable testing of different TCP algorithms over a range of emulated network path conditions, bottleneck rate limits and bottleneck queuing disciplines. TEACUP utilises a text-based configuration file to define experiments as combinations of parameters specifying desired network path and end host conditions. When multiple values are provided (e.g. for TCP...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    chemfiles

    chemfiles

    Modern library for chemistry file reading and writing

    Chemfiles is a modern and high-quality library for reading and writing trajectory files created by computational chemistry simulations program. These trajectories contains atomic positions, velocities, names, topology and sometimes more. Running simulations produce enormous amounts of data, which has to be post-processed to extract physical information about the simulated system. Chemfiles provides an interface to access this information which is - unified: the same code will work with...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    TEES

    Turku Event Extraction System

    Turku Event Extraction System (TEES) is a free and open source natural language processing system developed for the extraction of events and relations from biomedical text. It is written mostly in Python, and should work in generic Unix/Linux environments. Currently, the TEES source code repository still remains on GitHub at http://jbjorne.github.com/TEES/ where there is also a wiki with more information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Buildes

    Buildes

    A designer’s program for describing parts of the building

    Buildes is an integrated development environment that assists the user in creating building information. It reads a text (session) file from which it compiles the information. It then allows the user to browse, analyze and export the resulting building knowledge. The compilation system is written in pure Python. The building components created are rendered in PythonOCC. The GUI is written using PyQt.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    orbkit (Moved to Github)

    orbkit (Moved to Github)

    A Modular Python Toolbox for Cross-Platform Post-Processing of Quantum

    PLEASE NOTE ORBKIT HAS BEEN MOVED TO https://github.com/orbkit/orbkit orbkit is a parallel Python program package for post-processing wave function data extracted from output files of MOLPRO (Molden File Format), TURBOMOLE (AOMix file format), GAMESS-US, PROAIMS/AIMPAC (wfn/wfx file format), and Gaussian (Output File and Formatted Checkpoint File) output files. Futhermore, an interface to cclib, a parser for quantum chemical logfiles, is provided. If you use orbkit in your work, please...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    bnf2xml

    simple BNF parser makes xml markup of matches

    bnf2xml a simple BNF parser that takes text as input, searches according to a BNF query file, and outputs text marked up by the xml labels that show context. bnf2xml is as simple to use as any text binary ie, awk(1) grep(1). bnf2xml does not require C API because it outputs simple xml labeling. README is visible on file dl page. EXAMPLE: $ echo "hi" | bnf2xml patternfile <word><alph>h</alph><alph>i</alph></word> or <gas>hydrogen iodide</gas> patternfile says how to find...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Part-of-speech tagging is the task of assigning symbols from a particular set to words in a natural language text. ACOPOST implements and extends well-known machine learning techniques and provides a uniform environment for testing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    PortOpt

    PortOpt

    A portfolio-optimizer using Markowitz(1952) mean-variance model

    PortOpt [Portfolio Optimizer] is a C++ program (with Python binding) implementing the Markowitz(1952) mean-variance model with agent's linear indifference curves toward risk in order to find the optimal assets portfolio under risk. You have to provide PortOpt (in text files or - if you use the api - using your own code) the variance/covariance matrix of the assets, their average returns and the agent risk preference. It returns the vector of assets' shares that composes the optimal...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    mwetoolkit

    THIS PROJECT MIGRATED TO https://gitlab.com/mwetoolkit/mwetoolkit3/

    ... to virtually any text collection, language, and MWE type. It is a command-line tool written mostly in Python. Its development started in 2010 as a PhD thesis but the project keeps active (see the SVN logs). Up-to-date documentation and details about the tool can be found on the mwetoolkit website: http://mwetoolkit.sourceforge.net/
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    Qfsm

    Qfsm

    A graphical Finite State Machine (FSM) designer.

    A graphical tool for designing finite state machines and exporting them to Hardware Description Languages, such as VHDL, AHDL, Verilog, or Ragel/SMC files for C, C++, Objective-C, Java, Python, PHP, Perl, Lua code generation.
    Leader badge
    Downloads: 10 This Week
    Last Update:
    See Project
  • 22
    Mishkal: Arabic Text Vocalization

    Mishkal: Arabic Text Vocalization

    Arabic Text Vocalization system

    Automatic system of vocalization of arabic text.
    Downloads: 69 This Week
    Last Update:
    See Project
  • 23

    quichem

    Utilities for fast input of chemical formulae and equations.

    *** New releases of quichem will only be available on GitHub. *** quichem is a utility written in pure Python allowing for fast input and parsing of chemical formulae and equations. Entering data does not require any modifier keys or keystrokes to be pressed. This SourceForge page provided pre-built Windows binaries of quichem-pyside. **To run quichem on Mac OS X or Linux**, or with the wxPython front-end, please download the source from quichem's GitHub page at: github.com/spamalot...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    Language Constructor

    Complete tool for constructing/manipulating languages in digital form

    With this tool you can easily design a new language, digitize an existing one or incrementally reconstruct an ancient language. It allows for free experimentation of all aspects of the language, so it does not have to be made consistent on paper first. You can edit script, syntax, grammar, morphology, lexicon and phonology, as well as write documents in the language, as it might be too complex to be handled by current font technology. The information is stored in xml format for easy...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Easy Equations

    Easy Equations

    Hand Written Equation Creator

    Easy Equations is a utility using which one can write the mathematical equations. The focus of this utility is to provide user friendly access to write mathematical equations which is helpful for students, lecturers, mathematicians and Research persons who prefer using mathematical equations in a document, PowerPoint or web sites. Works on Windows as well as Linux platforms. Software Requirements: JDK 7 or higher. Linux Platform with kernel version 2.7 or higher.(for Linux users).python...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.