Showing 26 open source projects for "ofn-extract-objects.py"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Grafana: The open and composable observability platform Icon
    Grafana: The open and composable observability platform

    Faster answers, predictable costs, and no lock-in built by the team helping to make observability accessible to anyone.

    Grafana is the open source analytics & monitoring solution for every database.
    Learn More
  • 1
    Web Spider, Web Crawler, Email Extractor

    Web Spider, Web Crawler, Email Extractor

    Free Extracts Emails, Phones and custom text from Web using JAVA Regex

    In Files there is WebCrawlerMySQL.jar which supports MySql Connection Free Web Spider & Crawler. Extracts Information from Web by parsing millions of pages. Store data into Derby Database and data are not being lost after force closing the spider. - Free Web Spider , Parser, Extractor, Crawler - Extraction of Emails , Phones and Custom Text from Web - Export to Excel File - Data Saved into Derby and MySQL Database - Written in Java Cross Platform Also See Free email Sender :...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 2
    WebHarvest - web data extraction tool
    Web data extraction (web data mining, web scraping) tool. It leverages well proved XML and text processing techologies in order to easely extract useful data from arbitrary web pages.
    Downloads: 32 This Week
    Last Update:
    See Project
  • 3
    Web Spider, Web Crawler, Email Extractor

    Web Spider, Web Crawler, Email Extractor

    Free Extracts Emails, Phones and custom text from Web using JAVA Regex

    In Files there is WebCrawlerMySQL.jar which supports MySql Connection Please follow this link to get latest version https://sourceforge.net/projects/web-spider-web-crawler-extract/ Free Web Spider & Crawler. Extracts Information from Web by parsing millions of pages. Store data into Derby OR MySQL Database and data are not being lost after force closing the spider. - Free Web Spider , Parser, Extractor, Crawler - Extraction of Emails , Phones and Custom Text from Web - Export to Excel File - Data Saved into Derby Database - Written in Java Cross Platform See also Free Email Sender in this link: https://sourceforge.net/projects/gitst-free-email-ender/ Please install Microsoft OpenJDK to start the application https://www.microsoft.com/openjdk
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    OTRkeyfinder

    OTRkeyfinder

    Processes the site otrkeyfinder.com.

    ...Or did you ever think "It's stupid, to have to open a web browser to search for the files."? - This is a solution for you. User interface is German. Does not work behind proxies. Extract zipped files with http://www.7-zip.org/ . GERMAN: Wolltest du jemals Otrkeyfinder.com benutzen, wolltest aber keinen Browser-Tab dafür öffnen, oder hast du gedacht "Ich will jetzt aber keinen Browser öffnen, nur um die Seite nach Dateien zu durchsuchen"? - Dieses Programm ist eine Lösung. Funktioniert nicht, wenn Proxys für den Internetzugriff benötigt werden. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 5
    MineSoft Datamine System

    MineSoft Datamine System

    PHP application for datamining

    Application for datamining. Use for good not evil. this isnt totally practical if you are targetting MASS ammounts of websites. its not a bot. each url has to be entered by hand.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    webStraktor is a programmable World Wide Web data extraction client. Its purpose is to scrape HTML based content via the HTTP protocol and extract relevant information. webStraktor features a scripting language to facilitate the collection, the extraction and the storage of information available on the web, including images. The scripting language uses elements of the Regular Expression and xPath syntax. The webStraktor scripting language has a small instruction set and its syntax is easy to master. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Automatic categorization of texts based on supplied controlled vocabularies. Is a php tool to extract terms from a text and use it to obtain keywords from a specific controlled vocabulary. Use the terminological web services provided by TemaTres.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    WebExtractor360 is a free and open source web data extractor. It uses Regular Expressions to find, extract and scrape internet data quickly and easily.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    ImageCrawler Application to extract Images from Websites. A Thumbnail view is provided. Based on Spring.NET and the HTML Agility Pack
    Downloads: 0 This Week
    Last Update:
    See Project
  • For every need the right PDF solution Icon
    For every need the right PDF solution

    PDFCreator is used in many companies worldwide.

    PDFCreator converts every printable document to PDF and many other formats. Convert your documents to PDF, JPG, PNG, TIF and more. Merge multiple documents to one file. Use automatic saving to have a fully automated PDF printer. Profiles make frequently used settings available with one click. We take care of the complexity and make converting PDFs simple for you.
    Learn More
  • 10
    Law Leecher
    Law Leecher is a multi-threaded web crawling tool which extracts laws from the EU law database PreLex (http://ec.europa.eu/prelex/). It's written in Ruby.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    MuSE-CIR is a Multigram-based Search Engine and Collaborative Information Retrieval system. Written in Java /JSP, supports any JDBC connectable database - thoroughly tested only with OracleXE, and somewhat with MySQL, JSP on Apache Tomcat 5.5
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Questo script consente di evidenziare, estrarre e condividere contenuti da una pagina web tramite la semplice selezione col mouse. This script allows you to highlight, extract and share content from a web page simply by mouse selecting.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Spider that recollects data from MySpace Social Network. At now, it is only designed to extract information from native american people because it is used for a social science study in the UNAM (Universidad Nacional Autónoma de México).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    A utility to extract meta-information (properties/comments) out of various file-types; e.g. HTML, PDF, RTF & various Office documents; OGG/MP3 files and JPEG/PNG/GIF images, which can be presented in various output formats (HTML, XML, LaTeX & plain t
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Extract is an Web Information Management System which allows users to store and search many kind of structured data in a database (database records, Samba directories and files) classified in categories like in file system browsers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    WebExtractor360 is a free and open source web data extractor. It allows you to extract Images, Phrases, URLs (Links), URLs (Keywords), Emails, Phone, Fax and ANY other information on the web by specifying a Regular Expression. See http://www.webextractor
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    This project is aimed at extracting keywords from documents either as files or on the Internet. It applies sophisticated keyword ranking algorithm to extract most relevant keywords for a document and has also the capability of finding similar document in
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    TextMine is for the Perl hacker who is grappling with the problems of managing unstructured text from various sources. You can use these text mining tools to search the Web, index text, extract entities, categorize your e-mail, and summarize documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Download multiple job postings in XHTML for batch browsing. Can also be input into programs you write to screen, weight, sort, archive, analyse job requirements etc. Currently supports http://www.jobbank.gc.ca
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    WebNews Crawler is a specific web crawler (spider, fetcher) designed to acquire and clean news articles from RSS and HTML pages. It can do a site specific extraction to extract the actual news content only, filtering out the advertising and other cruft.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    GronoSpy is a WWW crawler which tries to extract knowledge based on the data from grono.net - a community portal.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Java program to extract postings and comments from http://www.livejournal.com (blog) into DB and view/classify/process it. LJ loader. Components to reuse: perl-like, but efficient Web pages scraper, trees analyzer, concurrent scheduler.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    JMdRdf is the tool which creates RDF/RSS. 1.You can generate RDF/RSS about your homepage from your HTML(s) without programming. JMdRdf extract Information such as title, description, etc automatically from HTML. 2.You can paste RDF/RSS into your HTML
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Command line HTML Parser to be used in scripts to extract data from HTML/webpage according to supplied path and options. Usefull for systematic periodic parsing pages with known structures where information keeps changing - like looking for item on ebay
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    using the vast TV show database of tvtome.com, this library will provide functinoality to extract any informations from their web pages through use of regular expressions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next