22 projects for "python web crawler" with 2 filters applied:

  • Simply solve complex auth. Easy for devs to set up. Easy for non-devs to use. Icon
    Simply solve complex auth. Easy for devs to set up. Easy for non-devs to use.

    Transform user access with Frontegg CIAM: login box, SSO, MFA, multi-tenancy, and 99.99% uptime.

    Custom auth drains 25% of dev time and risks 62% more breaches, stalling enterprise deals. Frontegg platform delivers a simple login box, seamless authentication (SSO, MFA, passwordless), robust multi-tenancy, and a customizable Admin Portal. Integrate fast with the React SDK, meet compliance needs, and focus on innovation.
    Start for Free
  • Get the most trusted enterprise browser Icon
    Get the most trusted enterprise browser

    Advanced built-in security helps IT prevent breaches before they happen

    Defend against security incidents with Chrome Enterprise. Create customizable controls, manage extensions and set proactive alerts to keep your data and employees protected without slowing down productivity.
    Download Chrome
  • 1
    wxMEdit

    wxMEdit

    wxMEdit, Cross-platform Text/Hex Editor, Improved Version of MadEdit

    •Added automatically checking for updates •Added bookmark support •Added right-click context menu for each tab •Added purging histories support •Added selecting a line by triple click •Added FreeBASIC syntax file •Added an option to place configuration files into %APPDATA% directory under Windows •Improved support for Find/Replace •Improved Mac OS X support •Improved system integration under Windows •Improved encoding detection result •Improved Hex editing support •Added more...
    Leader badge
    Downloads: 162 This Week
    Last Update:
    See Project
  • 2
    EpiDoc: Epigraphic Documents in TEI XML

    EpiDoc: Epigraphic Documents in TEI XML

    XML text markup for ancient documents

    The EpiDoc Collaborative is developing specifications and tools for standards-based, digital publication and interchange of scholarly and educational editions of documentary and literary texts like inscriptions and papyri. The link below will take you to the EpiDoc home page on this site.
    Leader badge
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    htmlarea

    htmlarea

    Small, powerful, full featured WYSIWYG editor

    HTMLArea 4 is a browser based WYSIWYG editor that easily replaces the TEXTAREA in your web pages. It is written in JavaScript, and suitable for use in any modern web browser, and any page on your web site. Current version is 4.0-2016-08-29
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    Pootle, Virtaal & Translate Toolkit

    Pootle, Virtaal & Translate Toolkit

    Localization tools built by localizers for localizers

    Tools for localization: - Pootle: web based translation management system. - Virtaal: Computer Aided Translation (CAT) tool. - Translate Toolkit: QA, format conversion and support (PO, Java .properties, OpenOffice, Mozilla, XLIFF, TMX, TBX, CSV, Qt .ts).
    Leader badge
    Downloads: 64 This Week
    Last Update:
    See Project
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    The database for AI-powered applications.

    MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
    Start Free
  • 5
    Voikko

    Voikko

    Library of linguistic tools

    Voikko is a spell checking, grammar checking, morphological analysis and hyphenation system. Spell checkers are available for multiple languages, other features for Finnish only.
    Leader badge
    Downloads: 19 This Week
    Last Update:
    See Project
  • 6

    Arabeyes

    A project to increase free Arabic fonts on Unix/Linux

    Arabeyes is a Meta project that is aimed at fully supporting the Arabic language in the Unix/Linux environment. It is designed to be a central location to standardize the Arabization process. Arabeyes relies on voluntary contributions.
    Leader badge
    Downloads: 217 This Week
    Last Update:
    See Project
  • 7
    Teng is a general purpose templating engine written in C++ (i.e. library). It is also available as Python module or PHP extension. The main idea of teng is to strictly separate application logic from presentation layer. Widely used on dynamic web sites.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    TransHelp is designed to assist in checking the consistency of Chinese-English translations in a translation project. It is written in php and python. It is especially useful in collaborative translation projects.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    ZML, the Zeitung Markup Language, is a simple CMS for small newspapers. It was specifically designed to publish a student newspaper in print and on the Web. It uses LaTeX and XHTML. So far, it is documented in German only.
    Downloads: 0 This Week
    Last Update:
    See Project
  • No-Nonsense Code-to-Cloud Security for Devs | Aikido Icon
    No-Nonsense Code-to-Cloud Security for Devs | Aikido

    Connect your GitHub, GitLab, Bitbucket, or Azure DevOps account to start scanning your repos for free.

    Aikido provides a unified security platform for developers, combining 12 powerful scans like SAST, DAST, and CSPM. AI-driven AutoFix and AutoTriage streamline vulnerability management, while runtime protection blocks attacks.
    Start for Free
  • 10
    iDocs is a intellectual document work flow with text mining options project.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    WikiPDF is a mediawiki extension based on Wiki2PDF that adds PDF/LaTeX features to mediawiki. Wiki2PDF is a python script to convert multiple articles of a mediawiki based wiki (pre-configured to use with www.wikipedia.org) to a single LaTeX or PDF file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Have you ever encountered dynamic web page (php, asp.net...) problems as like being browsed full of question marks? This (currently) tiny utility is for you. It tries to correct the problem by changing the file encoding to unicode.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    SilverCity is a lexing package, based on Scintilla, that can provide lexical analysis for over 20 programming and markup langauges. Scripting language bindings currently exist for Python.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    A Python-based template and view-controller framework derived from HTML::Mason. Supports the full featureset of Mason, allowing component-based web development with Python-embedded HTML, and includes many new concepts and features not found in Mason.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    An attempt at a Xanalogical Web, comprised of a CGI system with both native and HTML interfaces, a set of standards, a backend supporting Transclusion, versioning, and a custom browser with support for Transpointing and editing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Markdown wysiwyg - A web based wysiwyg editor for John Gruber's Markdown. This allows a gentle introduction to Markdown for novices. Works with Mozilla and IE. Not yet tested with safari or konqueror.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Process OpenOffice.org Writer Files and transform them to PDF without installing OpenOffice.org What is PyOpenOffice? * It is a class library, written in the Python Language. * It is a platform-independent command-line utility (many abilitie
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Yak is a PHP implementation of the Kant Generator Pro for Python. It randomly generates grammatically correct text (read: ranting) from XML-data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    pyBlog is yet another blogging software written in python for creating custom blogs. It works in offline mode containing an ftp client for up- and downloading files to any web server. It supports multiple users (bloggers) and blogging via mail (optional)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Mamba is a extensible xml templates preprocessor wrote in Python. Using it, you can rapidly develop powerful applications ready to integrate with the internet. It can be used to work as a generic CGI program or for generate content.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    SMILGen is a SMIL (and XML) authoring tool designed to ease the process of XML content creation. SMILGen understands XML syntax and handles the nesting, formatting and syntax of XML so that an author doesn't need to remember each quote and closing brace.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Cleanux holds different scripts and programs for cleaning up messy code, such as XML and HTML.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.