Search Results for "information extraction"

Showing 192 open source projects for "information extraction"

View related business solutions
  • Our Free Plans just got better! | Auth0 by Okta Icon
    Our Free Plans just got better! | Auth0 by Okta

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your secuirty. Auth0 now, thank yourself later.
    Try free now
  • The #1 Embedded Analytics Solution for SaaS Teams. Icon
    The #1 Embedded Analytics Solution for SaaS Teams.

    Qrvey saves engineering teams time and money with a turnkey multi-tenant solution connecting your data warehouse to your SaaS application.

    Qrvey’s comprehensive embedded analytics software enables you to design more customizable analytics experiences for your end users.
    Try Developer Playground
  • 1
    RedtDec

    RedtDec

    RetDec is a retargetable machine-code decompiler based on LLVM

    ... graphs, and various statistics. Detection and reconstruction of C++ class hierarchies (RTTI, vtables). Signature-based removal of statically linked library code. Static analysis of executable files with detailed information. We currently support Windows (7 or later), Linux, macOS, and (experimentally) FreeBSD. An installed version of RetDec requires approximately 5 to 6 GB of free disk space.
    Downloads: 37 This Week
    Last Update:
    See Project
  • 2
    audioFlux

    audioFlux

    A library for audio and music analysis, feature extraction

    A library for audio and music analysis, and feature extraction. Can be used for deep learning, pattern recognition, signal processing, bioinformatics, statistics, finance, etc. audioflux is a deep learning tool library for audio and music analysis, feature extraction. It supports dozens of time-frequency analysis transformation methods and hundreds of corresponding time-domain and frequency-domain feature combinations. It can be provided to deep learning networks for training and is used...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    ANTLR

    ANTLR

    Parser generator to read, process, or translate structured text

    ... and Pig, the data warehouse and analysis systems for Hadoop, both use ANTLR. Lex Machina uses ANTLR for information extraction from legal texts. Oracle uses ANTLR within SQL Developer IDE and their migration tools. NetBeans IDE parses C++ with ANTLR. The HQL language in the Hibernate object-relational mapping framework is built with ANTLR.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 4
    GROBID

    GROBID

    A machine learning software for extracting information

    ... here covers the usual bibliographical information (e.g. title, abstract, authors, affiliations, keywords, etc.). References extraction and parsing from articles in PDF format, around .87 F1-score against on an independent PubMed Central set of 1943 PDF containing 90,125 references, and around .89 on a similar bioRxiv set of 2000 PDF (using the Deep Learning citation model). All the usual publication metadata are covered (including DOI, PMID, etc.).
    Downloads: 1 This Week
    Last Update:
    See Project
  • Deliver secure remote access with OpenVPN. Icon
    Deliver secure remote access with OpenVPN.

    Trusted by nearly 20,000 customers worldwide, and all major cloud providers.

    OpenVPN's products provide scalable, secure remote access — giving complete freedom to your employees to work outside the office while securely accessing SaaS, the internet, and company resources.
    Get started — no credit card required.
  • 5
    DeepKE

    DeepKE

    An Open Toolkit for Knowledge Graph Extraction and Construction

    Supporting cnSchema, standard supervised setting, low-resource setting, document-level setting and multi-modal setting for knowledge base population. DeepKE is a knowledge extraction toolkit supporting cnSchema, standard supervised, low-resource, and document-level scenarios for entity, relation, and attribution extraction. It allows developers and researchers to customize datasets and models to extract information from unstructured texts. DeepKE supports low-resource settings with only a few...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    MMOCR

    MMOCR

    OpenMMLab Text Detection, Recognition and Understanding Toolbox

    MMOCR is an open-source toolbox based on PyTorch and mmdetection for text detection, text recognition, and the corresponding downstream tasks including key information extraction. It is part of the OpenMMLab project. The toolbox supports not only text detection and text recognition, but also their downstream tasks such as key information extraction. The toolbox supports a wide variety of state-of-the-art models for text detection, text recognition and key information extraction. The modular...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Simd Library

    Simd Library

    C++ image processing and machine learning library with using of SIMD

    The Simd Library is a free open-source image processing and machine learning library, designed for C and C++ programmers. It provides many useful high-performance algorithms for image processing such as pixel format conversion, image scaling and filtration, extraction of statistical information from images, motion detection, object detection and classification, neural networks. The algorithms are optimized with using of different SIMD CPU extensions. In particular, the library supports...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    EMBA

    EMBA

    The firmware security analyzer

    EMBA is designed as the central firmware analysis tool for penetration testers and product security teams. It supports the complete security analysis process starting with firmware extraction, doing static analysis and dynamic analysis via emulation and finally generating a web report. EMBA automatically discovers possible weak spots and vulnerabilities in firmware. Examples are insecure binaries, old and outdated software components, potentially vulnerable scripts, or hard-coded passwords...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    Trafilatura

    Trafilatura

    Python & command-line tool to gather text on the Web

    Trafilatura is a Python package and command-line tool designed to gather text on the Web. It includes discovery, extraction and text-processing components. Its main applications are web crawling, downloads, scraping, and extraction of main texts, metadata and comments. It aims at staying handy and modular: no database is required, the output can be converted to various commonly used formats. Going from raw HTML to essential parts can alleviate many problems related to text quality, first...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Free CRM Software With Something for Everyone Icon
    Free CRM Software With Something for Everyone

    216,000+ customers in over 135 countries grow their businesses with HubSpot

    Think CRM software is just about contact management? Think again. HubSpot CRM has free tools for everyone on your team, and it’s 100% free. Here’s how our free CRM solution makes your job easier.
    Get free CRM
  • 10
    spacy-llm

    spacy-llm

    Integrating LLMs into structured NLP pipelines

    Large Language Models (LLMs) feature powerful natural language understanding capabilities. With only a few (and sometimes no) examples, an LLM can be prompted to perform custom NLP tasks such as text categorization, named entity recognition, coreference resolution, information extraction and more. This package integrates Large Language Models (LLMs) into spaCy, featuring a modular system for fast prototyping and prompting, and turning unstructured responses into robust outputs for various NLP...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    react-docgen

    react-docgen

    A CLI and toolbox to extract information from React component files

    react-docgen is a CLI and toolbox to help extracting information from React components, and generate documentation from it. It uses @babel/parser to parse the source into an AST and provides methods to process this AST to extract the desired information. The output / return value is a JSON blob / JavaScript object. It provides a default implementation for React components defined via React.createClass, ES2015 class definitions or functions (stateless components). These component definitions...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    SageMaker Experiments Python SDK

    SageMaker Experiments Python SDK

    Experiment tracking and metric logging for Amazon SageMaker notebooks

    Experiment tracking in SageMaker Training Jobs, Processing Jobs, and Notebooks. SageMaker Experiments is an AWS service for tracking machine learning Experiments. The SageMaker Experiments Python SDK is a high-level interface to this service that helps you track Experiment information using Python. Experiment tracking powers the machine learning integrated development environment Amazon SageMaker Studio. Experiment: A collection of related Trials. Add Trials to an Experiment that you wish...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    spaCy models

    spaCy models

    Models for the spaCy Natural Language Processing (NLP) library

    spaCy is designed to help you do real work, to build real products, or gather real insights. The library respects your time, and tries to avoid wasting it. It's easy to install, and its API is simple and productive. spaCy excels at large-scale information extraction tasks. It's written from the ground up in carefully memory-managed Cython. If your application needs to process entire web dumps, spaCy is the library you want to be using. Since its release in 2015, spaCy has become an industry...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Transformers

    Transformers

    State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX

    Transformers provides APIs and tools to easily download and train state-of-the-art pre-trained models. Using pre-trained models can reduce your compute costs, carbon footprint, and save you the time and resources required to train a model from scratch. These models support common tasks in different modalities. Text, for tasks like text classification, information extraction, question answering, summarization, translation, text generation, in over 100 languages. Images, for tasks like image...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    FFmpeg Batch AV Converter

    FFmpeg Batch AV Converter

    Free all in one audio/video ffmpeg batch encoder

    FFmpeg Batch AV Converter is a free universal audio and video encoder, that allows to use the full potential of ffmpeg command line with a few mouse clicks in a convenient GUI with drag and drop, progress information. Some fancy wizards make things easy for non-experts. Thanks to its multi-file encoding feature, it may be the fastest a/v batch encoder available, since it maximizes system resources usage by launching as many simultaneous processes up to user cpu thread count. You can change...
    Leader badge
    Downloads: 1,517 This Week
    Last Update:
    See Project
  • 16
    VideoSubFinder
    The main purpose of this program is to provide functionality for extract hardcoded subtitles (hardsub) from video. It provides two main features: 1) Autodetection of frames with hardcoded text (hardsub) on video with saving info about timing positions. 2) Generation of cleared from background text images, which allows with usage of OCR programs (like FineReader, Subtitle Edit, Google Drive) to generate complete subtitles with original text and timing. For working of this program on...
    Leader badge
    Downloads: 350 This Week
    Last Update:
    See Project
  • 17
    Web Spider, Web Crawler, Email Extractor

    Web Spider, Web Crawler, Email Extractor

    Free Extracts Emails, Phones and custom text from Web using JAVA Regex

    In Files there is WebCrawlerMySQL.jar which supports MySql Connection Free Web Spider & Crawler. Extracts Information from Web by parsing millions of pages. Store data into Derby Database and data are not being lost after force closing the spider. - Free Web Spider , Parser, Extractor, Crawler - Extraction of Emails , Phones and Custom Text from Web - Export to Excel File - Data Saved into Derby and MySQL Database - Written in Java Cross Platform Also See Free email Sender...
    Leader badge
    Downloads: 74 This Week
    Last Update:
    See Project
  • 18
    Zip-Ada

    Zip-Ada

    A standalone, portable Ada library for .zip archives

    Zip-Ada is a library for .zip archives. Full sources are in Ada and are unconditionally portable. Input and output can be any stream (file, buffer,...) for archive creation as well as data extraction. Task safe and endian-neutral. More information on: http://unzip-ada.sf.net Alire crate: https://alire.ada.dev/crates/zipada Mirror: https://github.com/zertovitch/zip-ada/
    Leader badge
    Downloads: 59 This Week
    Last Update:
    See Project
  • 19
    Honeyview

    Honeyview

    Fast and lightweight image viewer

    Honeyview is a fast and lightweight image viewer that supports a wide range of image formats, including popular ones like JPG, PNG, and BMP, as well as less common formats such as PSD and RAW. Designed for users who need an efficient tool for viewing and organizing images, Honeyview provides a clean and intuitive interface with high-speed performance. It also supports viewing images within compressed archives, such as ZIP or RAR files, without the need for extraction. Additionally, Honeyview...
    Downloads: 49 This Week
    Last Update:
    See Project
  • 20
    LabPlot

    LabPlot

    Data Visualization and Analysis

    LabPlot is a FREE, open source and cross-platform Data Visualization and Analysis software accessible to everyone.
    Downloads: 43 This Week
    Last Update:
    See Project
  • 21
    Ominis-Osint

    Ominis-Osint

    Ominis OSINT: Secure Web-history Search

    Ominis OSINT Secure Web-history Search stands as a premier solution for digital reconnaissance, offering unparalleled capabilities in gathering, analyzing, and interpreting publicly available information sourced from diverse online platforms. With its comprehensive suite of features, Ominis equips users to navigate through the expansive digital landscape with precision and efficiency, enabling the extraction of valuable insights from a myriad of sources.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 22
    p3d

    p3d

    General data-reduction tool for fiber-fed integral-field spectrographs

    p3d is a general data-reduction tool for use with fiber-fed integral-field spectrographs (IFSs); although, the spectrum viewer works with spectrum data cubes of any origin. The tool is built about the proprietary software IDL (Harris/EXELIS; see http://www.harrisgeospatial.com), but can be used without any license. Most slowly running loops are implemented in parallelized C (OpenMP).
    Downloads: 10 This Week
    Last Update:
    See Project
  • 23
    Tensorflow Transformers

    Tensorflow Transformers

    State of the art faster Transformer with Tensorflow 2.0

    Imagine auto-regressive generation to be 90x faster. tf-transformers (Tensorflow Transformers) is designed to harness the full power of Tensorflow 2, designed specifically for Transformer based architecture. These models can be applied on text, for tasks like text classification, information extraction, question answering, summarization, translation, text generation, in over 100 languages. Images, for tasks like image classification, object detection, and segmentation. Audio, for tasks like...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Image16

    Image16

    Image viewer/editor specialized in astrophotography 16bits processing

    Opens/saves TIF, FITS, PNG, XISF, PXM, RAW, JPG, XCF, PSD. AsinH, Sqrt, Ln, Exp and Sqr stretching, HSL and RGB adjusting, Levels and Curves adjusting and Background Extraction for Color Balance, Flatness and Gradient Correction. Arithmetic between images. Resizing and croping. Color Balance based on background, selection or stars B-V (after platesolving). Debayering/Bayering. Inversion, mirroring, rotation, barrel/pincushion correction. RGB alignment/shifting. Erode, Dilate, ... Blurring...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 25
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing in C++17/20

    DocWire SDK, a standout C++17/20 data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. The upcoming integration of C++17 and C++20 will bring advanced functionalities, particularly in areas like HTTP capabilities and web data extraction. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document...
    Downloads: 6 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next