Showing 98 open source projects for "python web crawler"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • MongoDB Atlas | Run databases anywhere Icon
    MongoDB Atlas | Run databases anywhere

    Ensure the availability of your data with coverage across AWS, Azure, and GCP on MongoDB Atlas—the multi-cloud database for every enterprise.

    MongoDB Atlas allows you to build and run modern applications across 125+ cloud regions, spanning AWS, Azure, and Google Cloud. Its multi-cloud clusters enable seamless data distribution and automated failover between cloud providers, ensuring high availability and flexibility without added complexity.
    Learn More
  • 1
    miniblink49

    miniblink49

    Lighter, faster browser kernel of blink to integrate HTML UI in apps

    ... electron). Customize as you wish, simulate another browser environment. Perfect HTML5 support, friendly to various front-end libraries (support HTML5, and friendly to front framework). After turning off the cross-domain switch, you can use various cross-domain functions (support cross-domain). Headless mode, which greatly saves resources and is suitable for crawlers (headless mode, be suitable for Web Crawler).
    Downloads: 8 This Week
    Last Update:
    See Project
  • 2
    FreeTAKServer

    FreeTAKServer

    Situational Awareness Server compatible with TAK clients

    FTS is a Python3 implementation of a TAK Server for devices like ATAK, WinTAK, and ITAK, it is cross-platform and runs from a multi-node installation on AWS down to the Android edition. It's free and open source (released under the Eclipse Public License. FTS allows you to connect ATAK clients to share geo-information, to chat with all the connected clients, exchange files and more. It intends to support all the major use cases of the original TAK server.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 3
    FastHTML

    FastHTML

    The fastest way to create an HTML app

    Built on solid web foundations, not the latest fads - with FastHTML you can get started on anything from simple dashboards to scalable web applications in minutes.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4
    Memvid

    Memvid

    Video-based AI memory library. Store millions of text chunks in MP4

    Memvid encodes text chunks as QR codes within MP4 frames to build a portable “video memory” for AI systems. This innovative approach uses standard video containers and offers millisecond-level semantic search across large corpora with dramatically less storage than vector DBs. It's self-contained—no DB needed—and supports features like PDF indexing, chat integration, and cloud dashboards.
    Downloads: 4 This Week
    Last Update:
    See Project
  • Your top-rated shield against malware and online scams | Avast Free Antivirus Icon
    Your top-rated shield against malware and online scams | Avast Free Antivirus

    Browse and email in peace, supported by clever AI

    Our antivirus software scans for security and performance issues and helps you to fix them instantly. It also protects you in real time by analyzing unknown files before they reach your desktop PC or laptop — all for free.
    Free Download
  • 5
    openvpn-monitor

    openvpn-monitor

    openvpn-monitor is a web based OpenVPN monitor

    openvpn-monitor is a simple Python program to generate HTML that displays the status of an OpenVPN server, including all current connections. It uses the OpenVPN management console. It typically runs on the same host as the OpenVPN server, however, it does not necessarily need to. OpenVPN-monitor is a web-based OpenVPN monitor, that shows current connection information, such as users, location, and data transferred.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    Lexbor

    Lexbor

    Lexbor is development of an open source HTML Renderer library

    Lexbor is the development of a web browser engine available as a software library; it ships with a free license and has no extra dependencies. For us, speed is an absolute must-have. In our development process, we focus on fastest parsing techniques for HTML, CSS, and fonts, fastest data processing methods, and fastest ways to serve content to end users. Whether you are building a backend that handles millions of HTML documents or a UI-heavy user app, your software’s response rate always...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    uvicorn-gunicorn-fastapi

    uvicorn-gunicorn-fastapi

    Docker image with Uvicorn managed by Gunicorn

    Docker image with Uvicorn managed by Gunicorn for high-performance FastAPI web applications in Python with performance auto-tuning. Optionally with Alpine Linux.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Connexion

    Connexion

    Swagger/OpenAPI First framework for Python on top of Flask

    Connexion is a framework on top of Flask that automagically handles HTTP requests defined using OpenAPI (formerly known as Swagger), supporting both v2.0 and v3.0 of the specification. Connexion allows you to write these specifications, then maps the endpoints to your Python functions. This is what makes it unique from other tools that generate the specification based on your Python code. You are free to describe your REST API with as much detail as you want and then Connexion guarantees...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    granary

    granary

    The social web translator

    The social web translator. Fetches and converts data between social networks, HTML and JSON with microformats2, ActivityStreams/ActivityPub, Atom, JSON Feed, and more. Granary is a library and REST API that fetches and converts between a wide variety of social data sources and formats. Free yourself from silo API chaff and expose the sweet social data foodstuff inside in standard formats and protocols.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    SAFRS

    SAFRS

    SqlAlchemy Flask-Restful Swagger Json:API OpenAPI

    SAFRS exposes SQLAlchemy database models as a JSON:API web service and generates the corresponding swagger/OpenAPI. Database objects such as the User class from the demo.py example can be extended to include relationships with other objects. The demo_relationship.py contains the following extension of the User class where a relationship with the Book class is implemented.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    CWhy

    CWhy

    Explains and suggests fixes for compile-time errors for C, C++, C#, Go

    Explains and suggests fixes for compiler error messages for a wide range of programming languages, including C, C++, C#, Go, Java, LaTeX, PHP, Python, Ruby, Rust, Swift, and TypeScript. CWhy needs to be connected to an OpenAI account or an Amazon Web Services account. Your account will need to have a positive balance for this to work (check your OpenAI balance). CWhy currently defaults to GPT-4, and falls back to GPT-3.5-turbo if a request error occurs. For the newest and best model (GPT-4...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    pywebview

    pywebview

    Build GUI for your Python program with JavaScript, HTML, and CSS

    pywebview is a lightweight cross-platform wrapper around a webview component that allows to display HTML content in its own native GUI window. It gives you power of web technologies in your desktop application, hiding the fact that GUI is browser based. You can use pywebview either with a lightweight web framework like Flask or Bottle or on its own with a two way bridge between Python and DOM. pywebview uses native GUI for creating a web component window: WinForms on Windows, Cocoa on macOS...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    JS Beautifier

    JS Beautifier

    Beautifier for javascript

    ...-beautify script. As with the Python script, the beautified result is sent to stdout unless otherwise configured. You can also use js-beautify as a node library (install locally, the npm default). The beautifier can be added on your page as web library. JS Beautifier is hosted on two CDN services: cdnjs and rawgit. You can beautify javascript using JS Beautifier in your web browser, or on the command-line using node.js or python.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    HTTPie CLI

    HTTPie CLI

    Modern, user-friendly command-line HTTP client for the API era

    HTTPie (pronounced aitch-tee-tee-pie) is a command-line HTTP client. Its goal is to make CLI interaction with web services as human-friendly as possible. HTTPie is designed for testing, debugging, and generally interacting with APIs & HTTP servers. The HTTP & HTTPS commands allow for creating and sending arbitrary HTTP requests. They use simple and natural syntax and provide formatted and colorized output.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Framework Benchmarks

    Framework Benchmarks

    Source for the TechEmpower Framework Benchmarks project

    If you're new to the project, welcome! Please feel free to ask questions here. We encourage new frameworks and contributors to ask questions. We're here to help! This project provides representative performance measures across a wide field of web application frameworks. With much help from the community, coverage is quite broad and we are happy to broaden it further with contributions. The project presently includes frameworks on many languages including Go, Python, Java, Ruby, PHP, C#, F...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    CssSelector Component

    CssSelector Component

    Converts CSS selectors to XPath expressions

    ... to an XPath equivalent. This XPath expression can then be used with other functions and classes that use XPath to find elements in a document. Not all CSS selectors can be converted to XPath equivalents. There are several CSS selectors that only make sense in the context of a web-browser. Pseudo-elements (:before, :after, :first-line, :first-letter) are not supported because they select portions of text rather than elements.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    WhakerPy

    WhakerPy

    Whakerpy - A light web application framework

    Whakerpy is a simple library useful to create dynamic HTML content; it's a light web application framework. Create and manipulate HTML from the power of Python: - Easy to learn. Consistent, simple syntax. - Flexible and easy usage. - Create HTML pages dynamically - Can save as static files, and/or - Run locally with its httpd server and response "bakery" system. Access the documentation: <https://whakerpy.sourceforge.io>.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    EpiDoc: Epigraphic Documents in TEI XML

    EpiDoc: Epigraphic Documents in TEI XML

    XML text markup for ancient documents

    The EpiDoc Collaborative is developing specifications and tools for standards-based, digital publication and interchange of scholarly and educational editions of documentary and literary texts like inscriptions and papyri. The link below will take you to the EpiDoc home page on this site.
    Leader badge
    Downloads: 18 This Week
    Last Update:
    See Project
  • 19
    Bots-ediint open source edi translator

    Bots-ediint open source edi translator

    open source edi translator

    Bots-ediint is a file translator for EDI (Electronic Data Interchange). Translate files and databases data format to another one: edifact, x12, tradacoms, xml, json, raw, database (postgres, sqlite, mysql, oracle, ...) and more. Bots-ediint is based on Bots https://bots.sourceforge.net Support Python 2 and Python 3 AS2 communications are carried out with my external software that could work in standalone and/or in a Bots channel to send and receive AS2 messages (SYNC and ASYNC...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 20
    stml

    stml

    Indentation Procedure in HTML

    Functionalities of the STML * Indentation Procedure in HTML * Indentation Procedure in HTML * Better Implementation for Python Coders & others * Reduce usage of closing tags
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    pdf-editor

    pdf-editor

    Edit your PDFs without needing a subscription or creating accounts

    Edit your PDFs without needing a subscription or creating accounts. Add a GUI/Turn it into a web application. Add a parser for the command line to do multiple commands at once e.g. merge (cut pdf1) pdf2. Tested working with Python 3.8.5. Install venv (py -3.8 -m pip install virtualenv). PDF and Word documents are binary files, which makes them much more complex than plaintext files. In addition to text, they store lots of font, color, and layout information. If you want your programs to read...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22
    ADFILT

    ADFILT

    Web filter lists for countless different topics

    This is the place where I, Imre Kristoffer Eilertsen, host my web filter lists for countless different topics, for use in adblock tools and the like. GitHub was in mid-2017 by far the easiest way for laymen like me to store pure text files, which is a necessity to create subscribable lists. This is a hobby project of mine, in which I work just as much on these lists and this repo as I feel like. But don't be fooled by the appearance, as these are nevertheless some lists that I've placed lots...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    A set of tools (command line and GUI) to provide a complete digital photo workflow for Unixes. EXIF headers are used as the central information repository, so users may change their software at any time without loosing any data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Pytago

    Pytago

    A source-to-source transpiler for Python to Go translation

    pytago is a source-to-source transpiler that converts some Python into human-readable Go. It enables developers to translate Python codebases into Go, facilitating migration or interoperability between the two languages. ​
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    I Heart LA

    I Heart LA

    Compilable markdown for linear algebra

    I Heart LA is a compilable markdown for math. It can generate working code in your favorite language (C++, Python, MATLAB, more to come) and LaTeX from snippets.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • Next
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.