Showing 92 open source projects for "python text"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 1
    A tool for managing bibliographic databases (BibTeX, Refer,...) through an user friendly interface.
    Leader badge
    Downloads: 22 This Week
    Last Update:
    See Project
  • 2

    Safe Harbor Deidentification

    Safe Harbor Deidentification for medical documents

    Phalanx - Deidentify Safe Harbor Deidentification Mode of Phalanx is an abridged pipeline of NLP annotators culminating in NER annotators which write output of text offsets. It uses the Safe Harbor deidentification method.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    CMD Plot Tool

    CMD Plot Tool

    Calculates and plots Colour Magnitude Diagrams from Astronomical data

    CMD Plot Tool calculates and plots Colour Magnitude Diagrams (CMDs) from astronomical photometric data, e.g. of a star cluster observed in two filter bandpasses. It can handle multiple file formats (plain text, DAOPHOT .mag files, ACS Survey of Galactic Globular Clusters .zpt files) to generate professional and customised plots, without the usual steep learning curve. Development using Python, OOP and a formal software development lifecycle model (Agile) allowed for the creation of an application that can be deployed on multiple systems. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    Arabic Corpus

    Text categorization, arabic language processing, language modeling

    The Arabic Corpus {compiled by Dr. Mourad Abbas ( http://sites.google.com/site/mouradabbas9/corpora ) The corpus Khaleej-2004 contains 5690 documents. It is divided to 4 topics (categories). The corpus Watan-2004 contains 20291 documents organized in 6 topics (categories). Researchers who use these two corpora would mention the two main references: (1) For Watan-2004 corpus ---------------------- M. Abbas, K. Smaili, D. Berkani, (2011) Evaluation of Topic Identification Methods on...
    Leader badge
    Downloads: 6 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5

    Presage

    the intelligent predictive text entry platform

    Presage (formerly Soothsayer) is an intelligent predictive text entry system. Presage generates predictions by modelling natural language as a combination of redundant information sources. Presage computes probabilities for words which are most likely to be entered next by merging predictions generated by the different predictive algorithms. Presage's modular and extensible architecture allows its language model to be extended and customized to utilize statistical, syntactic, and semantic...
    Leader badge
    Downloads: 196 This Week
    Last Update:
    See Project
  • 6

    dadosSemiotica

    Collecter and manager of semiotica annalisis data

    This program is a web application to collect and organize data of text analysis. It works with sets of texts and the analysis are done on portions of the length of a sentence. One of the preprocessing modules is based on CoGroo (A LibreOffice & OpenOffice.org Portuguese Grammar Checker).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    aioulinux

    aioulinux

    Linux for Arduino and Makers developers

    Hello, I'm the Aioulinux founder, eager to professionally revive the project. Since 2018, the demand for an IoT and Arduino-tailored environment has been evident. Seeking partners for a 2024 version targeting schools and IoT companies, aiming for a secure and comprehensive platform. If you share this vision and wish to collaborate, reach out. Let's revive Aioulinux stronger than ever! Now seeking partners: Live Distro Specialist: Expert in live distributions to ensure...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    TEACUP

    TEACUP

    TCP Experiment Automation Controlled Using Python

    TEACUP automates many aspects of running TCP performance experiments in a specially-constructed physical testbed. TEACUP enables repeatable testing of different TCP algorithms over a range of emulated network path conditions, bottleneck rate limits and bottleneck queuing disciplines. TEACUP utilises a text-based configuration file to define experiments as combinations of parameters specifying desired network path and end host conditions. When multiple values are provided (e.g. for TCP...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    chemfiles

    chemfiles

    Modern library for chemistry file reading and writing

    Chemfiles is a modern and high-quality library for reading and writing trajectory files created by computational chemistry simulations program. These trajectories contains atomic positions, velocities, names, topology and sometimes more. Running simulations produce enormous amounts of data, which has to be post-processed to extract physical information about the simulated system. Chemfiles provides an interface to access this information which is - unified: the same code will work with...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    DimensionalAnalysis

    DimensionalAnalysis

    Performs dimensional analysis from cells in a spreadsheet.

    ...Make sure the names match the contents of the name cells. Use the names in the formulas, not the cell addresses. To use the Dimension Analysis tool, select and copy the cells, then paste them into the tool’s input text box. Requires Python 3 plus SymPy and mpmath packages. http://www.sympy.org http://mpmath.org/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    TEES

    Turku Event Extraction System

    Turku Event Extraction System (TEES) is a free and open source natural language processing system developed for the extraction of events and relations from biomedical text. It is written mostly in Python, and should work in generic Unix/Linux environments. Currently, the TEES source code repository still remains on GitHub at http://jbjorne.github.com/TEES/ where there is also a wiki with more information.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12

    BioC

    We describe a simple XML format to share text documents and annotation

    A minimalist approach to share text documents and data annotations. Allows a large number of different annotations to be represented. Project files contain: - simple code to hold/read/write data and perform sample processing. - BioC-formatted corpora - BioC tools that work with BioC corpora BioC goals - simplicity - interoperability - broad use - reuse There should be little investment required to learn to use a format or a software module to process that format. We are...
    Leader badge
    Downloads: 17 This Week
    Last Update:
    See Project
  • 13
    Buildes

    Buildes

    A designer’s program for describing parts of the building

    Buildes is an integrated development environment that assists the user in creating building information. It reads a text (session) file from which it compiles the information. It then allows the user to browse, analyze and export the resulting building knowledge. The compilation system is written in pure Python. The building components created are rendered in PythonOCC. The GUI is written using PyQt.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Part-of-speech tagging is the task of assigning symbols from a particular set to words in a natural language text. ACOPOST implements and extends well-known machine learning techniques and provides a uniform environment for testing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    PortOpt

    PortOpt

    A portfolio-optimizer using Markowitz(1952) mean-variance model

    PortOpt [Portfolio Optimizer] is a C++ program (with Python binding) implementing the Markowitz(1952) mean-variance model with agent's linear indifference curves toward risk in order to find the optimal assets portfolio under risk. You have to provide PortOpt (in text files or - if you use the api - using your own code) the variance/covariance matrix of the assets, their average returns and the agent risk preference.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    mwetoolkit

    THIS PROJECT MIGRATED TO https://gitlab.com/mwetoolkit/mwetoolkit3/

    THIS PROJECT MIGRATED TO https://gitlab.com/mwetoolkit/mwetoolkit3/ The Multiword Expressions toolkit aids in the automatic identification and extraction of multiword units in running text. These include idioms (kick the bucket), noun compounds (cable car), phrasal verbs (take off, give up), etc. Even though it focuses on multiword expresisons, the framework is quite complete and can also be useful in any corpus-based study in computational linguistics. The mwetoolkit can be applied to virtually any text collection, language, and MWE type. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Qfsm

    Qfsm

    A graphical Finite State Machine (FSM) designer.

    A graphical tool for designing finite state machines and exporting them to Hardware Description Languages, such as VHDL, AHDL, Verilog, or Ragel/SMC files for C, C++, Objective-C, Java, Python, PHP, Perl, Lua code generation.
    Leader badge
    Downloads: 16 This Week
    Last Update:
    See Project
  • 18

    DEAPathways

    Differential Expression Analysis for Pathways

    This project contains the source code associated with the PLoS Computational Biology publication: "Differential Expression Analysis for Pathways". The paper text can be found here: http://www.ploscompbiol.org/article/info%3Adoi%2F10.1371%2Fjournal.pcbi.1002967
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    iMir

    Integrated pipeline for HT miRNA-Seq data analysis

    Processing of smallRNA-Seq data to gather biologically relevant information requires application of multiple statistical and bioinformatics tools from different sources, each focusing on a specific step of the analysis pipeline. The analytical workflow can be challenging for the continuous interventions by the operator, a critical factor when large numbers of datasets need to be analyzed at once. To allow a flexible and comprehensive analysis of smallRNA-Seq data we designed a novel modular...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Mishkal: Arabic Text Vocalization

    Mishkal: Arabic Text Vocalization

    Arabic Text Vocalization system

    Automatic system of vocalization of arabic text.
    Downloads: 36 This Week
    Last Update:
    See Project
  • 21
    SAMoCAD

    SAMoCAD

    Simple CAD program

    SAMoCAD - its program to create simple drawings, partially supports DXF. Main features: - create simple objects (line, arc, circle) - create complex objects (text, dimensions) - edit drawn objects - save drawing to SVG format - output drawing in PostScript format - export/import content DXF files (primitives LINE, CIRCLE, ARC, TEXT, DIMENSION) Program written in Python 2 and use the library Tkinter. Program requires no installation, but you need install Python 2.7 or later on your PC. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    quichem

    Utilities for fast input of chemical formulae and equations.

    *** New releases of quichem will only be available on GitHub. *** quichem is a utility written in pure Python allowing for fast input and parsing of chemical formulae and equations. Entering data does not require any modifier keys or keystrokes to be pressed. This SourceForge page provided pre-built Windows binaries of quichem-pyside. **To run quichem on Mac OS X or Linux**, or with the wxPython front-end, please download the source from quichem's GitHub page at: github.com/spamalot/quichem.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    t2t-pipe

    automatic alignment pipeline for parallel treebanks

    The *Tree-to-Tree (t2t) Alignment Pipe* is a collection of python scripts, co-ordinating the process of automatic alignment of parallel treebanks from plain text files with a single call from a unix command line. Supported Languages: DE, FR, EN
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24

    Language Constructor

    Complete tool for constructing/manipulating languages in digital form

    With this tool you can easily design a new language, digitize an existing one or incrementally reconstruct an ancient language. It allows for free experimentation of all aspects of the language, so it does not have to be made consistent on paper first. You can edit script, syntax, grammar, morphology, lexicon and phonology, as well as write documents in the language, as it might be too complex to be handled by current font technology. The information is stored in xml format for easy...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Easy Equations

    Easy Equations

    Hand Written Equation Creator

    Easy Equations is a utility using which one can write the mathematical equations. The focus of this utility is to provide user friendly access to write mathematical equations which is helpful for students, lecturers, mathematicians and Research persons who prefer using mathematical equations in a document, PowerPoint or web sites. Works on Windows as well as Linux platforms. Software Requirements: JDK 7 or higher. Linux Platform with kernel version 2.7 or higher.(for Linux users).python...
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB