Showing 137 open source projects for "beta"

View related business solutions
  • Our Free Plans just got better! | Auth0 by Okta Icon
    Our Free Plans just got better! | Auth0 by Okta

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your secuirty. Auth0 now, thank yourself later.
    Try free now
  • Top-Rated Free CRM Software Icon
    Top-Rated Free CRM Software

    216,000+ customers in over 135 countries grow their businesses with HubSpot

    HubSpot is an AI-powered customer platform with all the software, integrations, and resources you need to connect your marketing, sales, and customer service. HubSpot's connected platform enables you to grow your business faster by focusing on what matters most: your customers.
    Get started free
  • 1
    Web Spider, Web Crawler, Email Extractor

    Web Spider, Web Crawler, Email Extractor

    Free Extracts Emails, Phones and custom text from Web using JAVA Regex

    In Files there is WebCrawlerMySQL.jar which supports MySql Connection Please follow this link to get latest version https://sourceforge.net/projects/web-spider-web-crawler-extract/ Free Web Spider & Crawler. Extracts Information from Web by parsing millions of pages. Store data into Derby OR MySQL Database and data are not being lost after force closing the spider. - Free Web Spider , Parser, Extractor, Crawler - Extraction of Emails , Phones and Custom Text from Web - Export...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    ImgurDL

    ImgurDL

    Imgur Gallery Downloader

    Users can now search Imgur for any phrase and ImgurDL/Loadur will automatically search for matching images. ImgurDL/Loadur will download the images while displaying the progress to the user.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    OpenSearchServer Search Engine

    OpenSearchServer Search Engine

    An open source search engine with RESTFul API and crawlers

    OpenSearchServer is a powerful, enterprise-class, search engine program. Using the web user interface, the crawlers (web, file, database, etc.) and the client libraries (REST/API , Ruby, Rails, Node.js, PHP, Perl) you will be able to integrate quickly and easily advanced full-text search capabilities in your application: Full-text with basic semantic, join queries, boolean queries, facet and filter, document (PDF, Office, etc.) indexation, web scrapping,etc. OpenSearchServer runs on...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 4
    YouSeer is an open source search engine framework, which was built on top of other open source components. It’s part of the general SeerSuite framework. YouSeer utilizes Hereitrix as a crawler and solr as an indexing system.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Bright Data - All in One Platform for Proxies and Web Scraping Icon
    Bright Data - All in One Platform for Proxies and Web Scraping

    Say goodbye to blocks, restrictions, and CAPTCHAs

    Bright Data offers the highest quality proxies with automated session management, IP rotation, and advanced web unlocking technology. Enjoy reliable, fast performance with easy integration, a user-friendly dashboard, and enterprise-grade scaling. Powered by ethically-sourced residential IPs for seamless web scraping.
    Get Started
  • 5
    Aperture is a Java framework for extracting and querying full-text content and metadata from various information systems (file systems, web sites, mail boxes, ...) and the file formats (documents, images, ...) occurring in these systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    WebHarvest - web data extraction tool
    Web data extraction (web data mining, web scraping) tool. It leverages well proved XML and text processing techologies in order to easely extract useful data from arbitrary web pages.
    Downloads: 50 This Week
    Last Update:
    See Project
  • 7

    anzmest

    GeoNetwork for AU/NZ users

    GeoNetwork 2.10.5 with Australia/New Zealand Metadata Profiles and NICTA National Map as map viewer - see screenshots. Beta version of GeoNetwork 3.1.x with Australia/New Zealand Metadata Profiles coming shortly.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    This package contains different tools to add NLP capabilities for Lucene 4.x (it has been tested using Lucene version from 4.6.x to 4.8.1). Although it was originally developed for German, it is, mostly, language independent. It allows the user to lemmatize words to be indexed, to weight termy ba their parts of speech (e.g. weighting nouns mor hevaily than pronouns), and to add synonyms taken from GermaNet or a list you provide to the search index and thereby increase recall of lucene.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    Bolter v. 1.0.0

    Bolter v. 1.0.0

    Interactive web-search.

    Program for web-search by defenite sites and periods of time. Definition by user. Using: http://www.yandex.ru/, http://www.google.ru. Search achived by redirect search query to search services. In other words Bolter - wrapper of existing search services. Visit http://vk.com/bolter_app for more info.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    webStraktor is a programmable World Wide Web data extraction client. Its purpose is to scrape HTML based content via the HTTP protocol and extract relevant information. webStraktor features a scripting language to facilitate the collection, the extraction and the storage of information available on the web, including images. The scripting language uses elements of the Regular Expression and xPath syntax. The webStraktor scripting language has a small instruction set and its syntax is easy...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    IRToolkit

    IRToolkit

    IRToolkit

    IRToolkit is an attempt to build and develop a generic search engine that integrates state-of-the-art Information Retrieval (IR) models. Furthermore, it offers a capability to compare the performance (in terms of precision, recall, index size, search response time and so on) between several open source IR applications. If you use the IRToolkit please cite the following work: https://sites.google.com/site/dinhbaduy/bibtex#Dinh-Phdthesis-2012
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    Infofuze

    Data migration/conversion library based on STX and XSLT transformation

    Infofuze is a Java library and server application that can be used to transform and combine data from various sources into a specific XML or other text output format that can be stored or indexed.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Framework for text mining, data integration and data analysis. Keywords: ontology and graph alignment, relation mining, warehouse, semantic database integration, bioinformatics, systems biology, microarray, Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    QuickFind

    QuickFind

    Instant File Search

    QuickFind is a cross platform Java Application for searching files in your Computer. The sole purpose of this application is to save user time by searching the desired file almost instantly. User has all the freedom to schedule or cache manually any time on selected directories. Once the caching is done all you have to do is input the name of the file you want to find.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    The goal of this project is to provide a reusable library to transform any web page or data to content objects by generic, configurable ContentProvider plugins for the iQser GIN Semantic Middleware (www.iqser.com).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    SMW SolrStore
    a Solr Based Semantic Mediawiki Store
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    GUDDI

    e-DSG Descoberta de Serviço Eletrônico Governamental

    GUDDI é uma solução livre desenvolvida com o Framework Demoiselle que implementa o conceito de e-DSG (Descoberta de Serviço Eletrônico Governamental) e segue os padrões do e-PING para auxiliar Entidades Públicas a divulgarem seus serviços.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    A web database and bioinformatics analysis tool for gene families involved in antigenic variation and immune system evasion in pathogens.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    CraigsLister

    Search Craigslist

    Searches one or more categories on Craiglist for keywords. Searches may be saved for later use, and may be limited to unseen (or "new") listings only. Results may be optionally sent via email.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    SeerSuite
    SeerSuite is an application toolkit for digital libraries and search engines; i.e., CiteSeerX. CiteSeerX has moved to GitHub, please get the latest code from: https://github.com/SeerLabs/CiteSeerX
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Roosster.org is a personal "on-demand" search engine. This means, it indexes only items/entries/files/URLs you explicitly tell it to index and provides a full-text-search over indexed items.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    DoxMentor4J is a standalone cross platform Web/Ajax based documentation library that is fully searchable and may be hosted in the file system, in an archive or embedded in the Java classpath.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    TestEl is a Java-based learning analyzer for HTML (and possibly other) structured documents. It can be trained to detect structures in such documents and renders hits in XML.
    Downloads: 0 This Week
    Last Update:
    See Project