Showing 24 open source projects for "documents"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Learn More
  • 1
    PDFMathTranslate

    PDFMathTranslate

    PDF scientific paper translation with preserved formats

    PDFMathTranslate is a Python-based tool that uses AI translation to convert academic PDFs into bilingual (e.g. Chinese-English) documents while preserving formatting, including math notation. It supports OCR-enhanced content and offers CLI, GUI, Docker, and Zotero integration under AGPL v3.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 2
    Handcalcs

    Handcalcs

    Python library for converting Python calculations into rendered latex

    Handcalcs is a Python library that auto-renders calculation code in Jupyter notebooks or LaTeX documents with step-by-step symbolic substitution, giving output a “handwritten” feel. It supports cell magics and auto-LaTeX generation via configurable output options.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    PaperQA2

    PaperQA2

    High accuracy RAG for answering questions from scientific documents

    PaperQA2 is a package for doing high-accuracy retrieval augmented generation (RAG) on PDFs or text files, with a focus on the scientific literature. See our recent 2024 paper to see examples of PaperQA2's superhuman performance in scientific tasks like question answering, summarization, and contradiction detection. In this example we take a folder of research paper PDFs, magically get their metadata - including citation counts and a retraction check, then parse and cache PDFs into a...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4

    pyLogos

    Qualitative content analysis software.

    pyLogos is a program to support text content analysis. Documents (imported from txt and docx files) are stored in a database, and may have marked text segments associated with codes. It is possible to retrieve these segments in various ways, generate word clouds, tabulate frequency of codes and words, among other outputs. pyLogos é um programa de apoio à análise de conteúdo de textos. Documentos (importados de arquivos txt e docx) são armazenados numa base de dados, podendo ter segmentos de textos marcados a associados a códigos. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Top Corporate LMS for Training | Best Learning Management Software Icon
    Top Corporate LMS for Training | Best Learning Management Software

    Deliver and Track Online Training and Stay Compliant - with Axis LMS!

    Axis LMS enables you to deliver online and virtual learning and training through a scalable, easy-to-use LMS that is designed to enhance your training, automate your workflows, engage your learners and keep you compliant.
    Learn More
  • 5
    PySchool

    PySchool

    Installable / Portable Python Distribution for Everyone.

    PySchool is a free and open-source Python distribution intended primarily for students who learn Python and data analysis, but it can also used by scientists, engineering, and data scientists. It includes more than 150 Python packages (full edition) including numpy, pandas, scipy, sympy, keras, scikit-learn, matplotlib, seaborn, beautifulsoup4...
    Leader badge
    Downloads: 1,453 This Week
    Last Update:
    See Project
  • 6

    MITRE Annotation Toolkit

    A toolkit for managing and manipulating text annotations

    The MITRE Annotation Toolkit (MAT) is a suite of tools which can be used for automated and human tagging of annotations. Annotation is a process, used mostly by researchers in natural language processing, of enhancing documents with information about the various phrase types the documents contain. MAT supports both UI interaction and command-line interaction, and provides various levels of control over the overall annotation process. It can be customized for specific tasks (e.g., named entity identification, de-identification of medical records). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    LP CSIC/UAB Apps and Code

    LP CSIC/UAB Apps and Code

    Software and Code from Laboratori de Proteòmica CSIC/UAB

    Software, Code and Documents from Laboratori de Proteòmica CSIC/UAB ( LP-CSIC/UAB: http://proteomica.uab.cat )
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    Safe Harbor Deidentification

    Safe Harbor Deidentification for medical documents

    Phalanx - Deidentify Safe Harbor Deidentification Mode of Phalanx is an abridged pipeline of NLP annotators culminating in NER annotators which write output of text offsets. It uses the Safe Harbor deidentification method.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    Arabic Corpus

    Text categorization, arabic language processing, language modeling

    The Arabic Corpus {compiled by Dr. Mourad Abbas ( http://sites.google.com/site/mouradabbas9/corpora ) The corpus Khaleej-2004 contains 5690 documents. It is divided to 4 topics (categories). The corpus Watan-2004 contains 20291 documents organized in 6 topics (categories). Researchers who use these two corpora would mention the two main references: (1) For Watan-2004 corpus ---------------------- M. Abbas, K. Smaili, D. Berkani, (2011) Evaluation of Topic Identification Methods on Arabic Corpora,JOURNAL OF DIGITAL INFORMATION MANAGEMENT,vol. 9, N. 5, pp.185-192. 2) For Khaleej-2004 corpus --------------------------------- M. ...
    Leader badge
    Downloads: 3 This Week
    Last Update:
    See Project
  • Reach Your Audience with Rise Vision, the #1 Cloud Digital Signage Software Solution Icon
    Reach Your Audience with Rise Vision, the #1 Cloud Digital Signage Software Solution

    K-12 Schools, Higher Education, Businesses, Restaurants

    Rise Vision is the #1 digital signage company, offering easy-to-use cloud digital signage software compatible with any player across multiple screens. Forget about static displays. Save time and boost sales with 500+ customizable content templates for your screens. If you ever need help, get free training and exceptionally fast support.
    Learn More
  • 10

    SCons gnuplot tool

    SCons tool to cooperate with gnuplot program

    ...This package is an extension (tool) to SCons, which enables "compilation" of gnuplot scripts in SCons-based projects. The intended audience include (but is not limited to) authors/publishers of LaTeX documents, scientific comunity and authors of the projects relying on automatic generation of plots.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    BioC

    We describe a simple XML format to share text documents and annotation

    A minimalist approach to share text documents and data annotations. Allows a large number of different annotations to be represented. Project files contain: - simple code to hold/read/write data and perform sample processing. - BioC-formatted corpora - BioC tools that work with BioC corpora BioC goals - simplicity - interoperability - broad use - reuse There should be little investment required to learn to use a format or a software module to process that format. ...
    Leader badge
    Downloads: 4 This Week
    Last Update:
    See Project
  • 12
    Create, check, annotate, merge, diff, split SBML (System Biology Markup Lanugage) documents. The latest version of semanticSBML is webbased with a RESTful interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    Language Constructor

    Complete tool for constructing/manipulating languages in digital form

    ...It allows for free experimentation of all aspects of the language, so it does not have to be made consistent on paper first. You can edit script, syntax, grammar, morphology, lexicon and phonology, as well as write documents in the language, as it might be too complex to be handled by current font technology. The information is stored in xml format for easy integration with other software.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14

    Planets

    A program for approximate simulation of planetary systems

    ...This program can be used in schools for illustration and students can experiment with it themselves. I created this project as part of the German Abitur exam, which is why the help documents are currently only available in German. Requirements: - Python 2.7 - Windows 7, Windows XP, Ubuntu Other operating systems may work but were not tested.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Redundancy due to cut-paste operations in text creates bias in machine learning for NLP. This module takes a directory and produces a subset of the files in that directory (in a list) with an upper bound on similarity between two files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Basic Experimenter
    Wizard-like software to run experimental procedures (at large). Each step of the procedure is a file (document, on-line form, checklist, video, program... ). Experimental data and documents are backuped together. NEW: templates of procedures
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    PyMemorial is a tool to generate cadastral survey documentation using brazilian's INCRA standards. Completely written in Python it aims to be a simple way to produce all documents following the most current legislation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Goldify is a set of tools that allow automated addition of links into electronic documents. Its main purpose is to allow such addition of links into documents that wish to link to the IUPAC GoldBook (http://goldbook.iupac.org).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    calioPY (keh-li'-oh-pie) is Python software that lets you visualize, organize, publish and collaborate on calculations, clearly and simply. It runs in the Leo organizer and knows how to format text, equations, tables, plots, figures, and merge documents
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Crow - Computational Representation Of Whatever. A platform for the integration and mining of complex and distributed data. Represents cross-linked semantic web documents as a network of software objects and offers easy ways to filter, and sort them.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Universal information crawler is a fast precise and reliable Internet crawler. Uicrawler is a program/automated script which browses the World Wide Web in a methodical, automated manner and creates the index of documents that it accesses.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    AGTK is a suite of software components for building tools for annotating linguistic signals, time-series data which documents any kind of linguistic behavior (e.g. audio, video). The internal data structures are based on annotation graphs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Mainly a browser for medical patients documents. For now, it works for displaying radiology images in DICOM format. It supports Query/Retrieve. Build on Python, wxPython and ZODB.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Comparer is a document comparer program. The initial task of the comparer is to find the likeness of all documents given to it by generating a graph of all usable connections between documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next