Showing 55 open source projects for "python web crawler"

View related business solutions
  • Simply solve complex auth. Easy for devs to set up. Easy for non-devs to use. Icon
    Simply solve complex auth. Easy for devs to set up. Easy for non-devs to use.

    Transform user access with Frontegg CIAM: login box, SSO, MFA, multi-tenancy, and 99.99% uptime.

    Custom auth drains 25% of dev time and risks 62% more breaches, stalling enterprise deals. Frontegg platform delivers a simple login box, seamless authentication (SSO, MFA, passwordless), robust multi-tenancy, and a customizable Admin Portal. Integrate fast with the React SDK, meet compliance needs, and focus on innovation.
    Start for Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    PULSAR

    PULSAR

    Distributed pub-sub messaging system

    ... durability guarantees. Configurable replication between data centers across multiple geographic regions. Built from the ground up as a multi-tenant system. Supports isolation, authentication, authorization and quotas. Persistent message storage based on Apache BookKeeper. IO-level isolation between write and read operations. Flexible messaging models with high-level APIs for Java, Go, Python, C++, Node.js, WebSocket and C#.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Siddhi Core Libraries

    Siddhi Core Libraries

    Stream Processing and Complex Event Processing Engine

    ... to various endpoints in real time. Agile development experience with SQL-like query language and graphical drag-and-drop editor supporting event simulation. Lightweight runtime that can natively run on Kubernetes, Docker, VM, or bare metal, and embedded in any Java or Python application. Scalable, and highly available distributed event processing on Kubernetes, with NATS Streaming and Siddhi Kubernetes Operator.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    HEALPix

    HEALPix

    Data Analysis, Simulations and Visualization on the Sphere

    Software for pixelization, hierarchical indexation, synthesis, analysis, and visualization of data on the sphere. Please acknowledge HEALPix by quoting the web page http://healpix.sourceforge.net (or https://healpix.sourceforge.io) and publication: K.M. Gorski et al., 2005, Ap.J., 622, p.759 Full software documentation available at https://healpix.sourceforge.io/documentation.php Wiki Pages: https://sourceforge.net/p/healpix/wiki/Home Exchanging Data with HEALPix (in FITS files): https...
    Leader badge
    Downloads: 586 This Week
    Last Update:
    See Project
  • 4
    This is a C library to check the validity of German and Austrian Bank Account Numbers. All currently defined test methods by Deutsche Bundesbank (Dec 2017: 00 to E4) are implemented. Modules for AWK, Perl, PHP, Python, Ruby, C#.net and VB.net are included too. The package includes also an IBAN converter to generate (german) IBANs and BICs from account data. All currently defined IBAN rules by Deutsche Bundesbank are implemented (Dec 2017: 57 rules) and tested against independent solutions.
    Leader badge
    Downloads: 5 This Week
    Last Update:
    See Project
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 5
    Autoplot

    Autoplot

    Autoplot is an interactive browser for data on the web

    Autoplot is an interactive browser for data on the web. Give Autoplot a URL or local file name and it creates a sensible plot of the data. Autoplot allows you to interactively browse data stored in ascii, .cdf, netcdf, and many other formats. Autoplot's source has been moved to GitHub. Thanks to SourceForge for many years of hosting!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    opsi

    opsi

    open source client management system, software deployment

    opsi is a Client Management System for Windows and Linux clients based on Linux servers. opsi is used for automatic OS installation, software depolyment, hard- and software inventory, distributed software depots.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 7
    rocket-bi

    rocket-bi

    An open-source web-based self-service BI for analytical databases

    Rocket.BI is a free, open-source, web-based business intelligence solution specifically designed for analytical databases. It enables data analysts and business users alike to easily integrate different data sources, perform advanced data analysis, ad hoc, and more. With an easy-to-use editor, you can create personalized reports, build interactive business dashboards and generate actionable business insights. Rocket.BI also allows collaboration as working together with other people...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    proreports

    proreports

    Simple Reporting System

    ... or Python). This give him very high flexibility because in this format there are almost no limits and this is the true power of this simple system.To the system we can attach own plug-ins written in PHP5 or JAVA. System is rather framework for programming reports (and not only - is a good tool for building REST API microservices) than drag&drop tool.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    OpenEMM e-mail & marketing automation

    OpenEMM e-mail & marketing automation

    software for email automation (newsletters, transaction mails, etc.)

    This is an archive for old versions of OpenEMM. You will find the latest version of OpenEMM here: https://wiki.openemm.org OpenEMM is a browser-based enterprise application for email automation like info and marketing newsletters, transaction mails or multi-stage email campaigns. OpenEMM offers tons of features for professional users, among them: a great user interface, template-based HTML mailings, automatic bounce management, mail opening and link tracking, lots of graphical realtime...
    Downloads: 12 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    Liferay Portal

    Liferay Portal

    The world's leading open source portal

    Liferay Portal is the world's leading enterprise open source portal framework, offering integrated Web publishing and content management, an enterprise service bus and service-oriented architecture, and compatibility with all major IT infrastructure. Check GitHub for our latest releases: https://github.com/liferay/liferay-portal/releases https://github.com/liferay/liferay-ide/releases
    Leader badge
    Downloads: 337 This Week
    Last Update:
    See Project
  • 11
    CloverDX

    CloverDX

    Design, automate, operate and publish data pipelines at scale

    .... Simple data manipulation jobs can be created visually. More complex business logic can be implemented using Clover's domain-specific-language CTL, in Java or languages like Python or JavaScript. Through its DataServices functionality, it allows to quickly turn data pipelines into REST API endpoints. The platform allows to easily scale your data job across multiple cores or nodes/machines. Supports Docker/Kubernetes deployments and offers AWS/Azure images in their respective marketplace
    Downloads: 7 This Week
    Last Update:
    See Project
  • 12
    OpenSearchServer Search Engine

    OpenSearchServer Search Engine

    An open source search engine with RESTFul API and crawlers

    OpenSearchServer is a powerful, enterprise-class, search engine program. Using the web user interface, the crawlers (web, file, database, etc.) and the client libraries (REST/API , Ruby, Rails, Node.js, PHP, Perl) you will be able to integrate quickly and easily advanced full-text search capabilities in your application: Full-text with basic semantic, join queries, boolean queries, facet and filter, document (PDF, Office, etc.) indexation, web scrapping,etc. OpenSearchServer runs on Windows...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 13
    Kingpin

    Kingpin

    NEXT GENERATION BUSINESS PROCESS AND DOCUMENT MANAGEMENT SYSTEM

    KINGPIN - A New Paradigm Of Working in Action Connecting People, Process, System and Technology - Kingpin can understand all forms of collaborative tasks, interact naturally with team and systems and people at scale. Say goodbye to silo solutions. Make your work effortless and efficient with a single solution on connected architecture that takes care of all your management needs. Kingpin facilitates communication between different departments, external vendors and service providers to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    DSTK - DataScience ToolKit

    DSTK - DataScience ToolKit

    DSTK - DataScience ToolKit for All of Us

    ... JASP for advanced data editing and RapidMiner for advanced prediction modeling. DSTK is written in C#, Java and Python to interface with R, NLTK, and Weka. It can be expanded with plugins using R Scripts. We have also created plugins for more statistical functions, and Big Data Analytics with Microsoft Azure HDInsights (Spark Server) with Livy. License: R, RStudio, NLTK, SciPy, SKLearn, MatPlotLib, Weka, ... each has their own licenses.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    Platform Course

    Opensource framework for creation bussiness web applications

    Platform Course 5.0.0 is opensource framework for easy development of original solutions for unique business process. Main advantages: cross-domain auth, LDAP integration, cross-browser GWT-based UI, big tables handling, chart and geo maps, input forms on XForms. Tested with MSSQL, PostgreSQL and Oracle. Actual SVN Repository for this product is located at https://share.curs.ru/svn/showcase/branches/stable/ Login: reader Password: reader If you wish to know further...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    COAR-DMS

    COAR-DMS

    DMS for linux, C++ library, server, webUI , SOAP

    ... security (rwx), special authorities - from thousands to tens of billions of documents - dashboard (working copies, new documents) - electronic signs - search statement, syntax like SQL - multithreaded, multiprocess library, Servers: - native HTTP server (libmicrohttp) - SOAP server - WebDAV(planed) - Indexer Python API WebUI GWT, JSP, SOAP-API
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    scipion-xmipp

    scipion-xmipp

    Image processing framework to integrate EM software packages.

    Scipion is an image processing framework to obtain 3D models of macromolecular complexes using Electron Microscopy (3DEM). It integrates several software packages and presents an unified interface for both biologists and developers. Scipion allows to execute workflows combining different software tools, while taking care of formats and conversions. Additionally, all steps are tracked and can be reproduced later on. Xmipp is a well-known package in the EM image processing. It is integrated...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    Debian Unstable/testing alpha.

    Pre-Alpha Console distro for high performance Linux computing.

    Basic Debian INTEL/AMD HPC console/desktop multi purpose. Linux for software devs/QA and thin laboratory clusters. MIC hardware, NUMA and live network memory and full IPv6 are supported. Final alpha release features: ======== * Usb image * General debian repos * VM optimizations * Hugepages by default * CGROUPs optimized by default * ALSA * Latest Debugging / Baremetal kernels * Full Haswell and MIC support in test * Full optimized python support * Full...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    webStraktor is a programmable World Wide Web data extraction client. Its purpose is to scrape HTML based content via the HTTP protocol and extract relevant information. webStraktor features a scripting language to facilitate the collection, the extraction and the storage of information available on the web, including images. The scripting language uses elements of the Regular Expression and xPath syntax. The webStraktor scripting language has a small instruction set and its syntax is easy...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    PengERP is based on OpenERP,but more powerful.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Portable Linux

    Portable Linux

    Portable Ubuntu Linux for Scientific Computing

    Released August 22, 2013 Lubuntu Blends: Biochemistry 13.04 (Raring) v5.44 Linux Kernel Image 3.8.0-29 Lubuntu Blends are pre-installed Wubi disk image remixes of Ubuntu and Debian Science meta packages, A custom boot loader allows installations to be copied and automatically booted from most external or USB flash drives. Once up and running, use earlier Lubuntu Remix README instructions here until documentation is updated....
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22

    CC2

    Cyclos 3.7 port with improved UI and additional reporting app

    CC2 is a customised, repackaged version of Cyclos 3.7 which is designed to * Allow small communities to get up and running quickly with the world’s leading alternative currency software * Improve the look and feel of Cyclos and make it more intuitive for users * Add additional reporting functionality (x and z reporting, automated invoicing) for business users **Please see the Release Note** on the Wiki Tab for background of this project and non-technical aspects of setting up...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    Constellio Enterprise Search engine

    Constellio Enterprise Search engine

    Open source Search Engine and Enterprise Search

    Constellio is an enterprise search engine that allows companies to search all their organization's information through a single interface (Web, CRM, ERP, ECM, Mail etc.). Constellio is Based on Apache Solr and Google Search Appliance's connector. Constellio has a powerful web crawler.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    Screenshot Paste plugin for Trac

    Screenshot Paste plugin for Trac

    A Trac plugin to allow pasting screenshots or images with one click

    A Trac plugin to allow pasting screenshots or other images captured or copied in the clipboard directly as attachements to tickets, Wiki pages, etc., without the need to first saving as images and then uploading them. Once the plugin is installed in Trac, you can easily attach a screenshot or any image you have in the clipboard to a Ticket or Wiki page, with one click.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    RiverGlass EssentialScanner is an open source web and file system crawler which indexes the text content of discovered files so they can be retrieved and analyzed. It provides simple scanner capabilities as part of larger enterprise search solutions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.