Showing 106 open source projects for "python web crawler"

View related business solutions
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    Build gen AI apps with an all-in-one modern database: MongoDB Atlas

    MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.
    Start Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    Pytago

    Pytago

    A source-to-source transpiler for Python to Go translation

    pytago is a source-to-source transpiler that converts some Python into human-readable Go. It enables developers to translate Python codebases into Go, facilitating migration or interoperability between the two languages. ​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    I Heart LA

    I Heart LA

    Compilable markdown for linear algebra

    I Heart LA is a compilable markdown for math. It can generate working code in your favorite language (C++, Python, MATLAB, more to come) and LaTeX from snippets.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Just Another Desktop Environment

    Just Another Desktop Environment

    Linux desktop environment built with HTML5, CSS, JavaScript and Python

    Desktop Environment built with Web Technologies, JDE takes over your desktop to manage applications a dock or panel is still needed to complement it. Clean and minimalistic interface. Settings panel. Show/Hide application categories. Keyboard application search. Visual application search. Settings panel integrates with individual application settings. Dbus integration. UI inspector. Animated Backgrounds. Drag and Drop, optional Window auto tile. Desktop Tour on first run. Scriptable workspaces.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    CBMPy

    CBMPy

    PySCeS Constraint Based Modelling

    PySCeS CBMPy is a new platform for constraint based modelling and analysis. It has been designed using principles developed in the PySCeS simulation software project: usability, flexibility and accessibility. CBMPy supports the latest standards for encoding CBM models encoding, SBML L3 FBC, COBRA as well as MIRIAM compliant RDF and custom annotations. Its architecture is both extensible and flexible using data structures that are intuitive to the biologist while transparently...
    Downloads: 0 This Week
    Last Update:
    See Project
  • eProcurement Software Icon
    eProcurement Software

    Enterprises and companies seeking a solution to manage all their procurement operations and processes

    eBuyerAssist by Eyvo is a cloud-based procurement solution designed for businesses of all sizes and industries. Fully modular and scalable, it streamlines the entire procurement lifecycle—from requisition to fulfillment. The platform includes powerful tools for strategic sourcing, supplier management, warehouse operations, and contract oversight. Additional modules cover purchase orders, approval workflows, inventory and asset management, customer orders, budget control, cost accounting, invoice matching, vendor credit checks, and risk analysis. eBuyerAssist centralizes all procurement functions into a single, easy-to-use system—improving visibility, control, and efficiency across your organization. Whether you're aiming to reduce costs, enhance compliance, or align procurement with broader business goals, eBuyerAssist helps you get there faster, smarter, and with measurable results.
    Learn More
  • 5
    Budou

    Budou

    Budou is an auto organizer tool for beautiful line breaking in CJK

    Budou is a Python library developed by Google to improve web typography for CJK (Chinese, Japanese, Korean) languages by producing semantically meaningful line breaks. Unlike English, CJK scripts lack spaces or hyphenation cues, often resulting in awkward or unreadable text wrapping on web pages. Budou addresses this issue by segmenting sentences into logical lexical chunks and wrapping each chunk in non-breaking HTML <span> tags.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Full Stack FastAPI Couchbase

    Full Stack FastAPI Couchbase

    Full stack, modern web application generator

    Full stack, modern web application generator. Using FastAPI, Couchbase as a database, Docker, automatic HTTPS, and more. Couchbase has a great set of features that is not easily or commonly found in alternatives. REST backend tests based on Pytest, integrated with Docker, so you can test the full API interaction, independent on the database. As it runs in Docker, it can build a new data store from scratch each time (so you can use ElasticSearch, MongoDB, or whatever you want, and just test...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Bots open source edi translator

    Bots open source edi translator

    Bots is a complete translator for edi: edifact, x12, xml, tradacoms

    Bots is a complete translator for edi (Electronic Data Interchange). EDI data formats eg: edifact, x12, tradacoms, xml. Mail: http://groups.google.com/group/botsmail Web-site: http://bots.sourceforge.net Wiki: http://bots.readthedocs.io Develop: https://github.com/eppye-bots/bots
    Downloads: 21 This Week
    Last Update:
    See Project
  • 8
    pyspider

    pyspider

    A powerful Spider(Web Crawler) system in Python

    pyspider is a powerful Spider(Web Crawler) system in Python. Components are connected by message queue. Every component, including message queue, is running in their own process/thread, and replaceable. That means, when process is slow, you can have many instances of processor and make full use of multiple CPUs, or deploy to multiple machines. This architecture makes pyspider really fast. benchmarking.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    crawler4j

    crawler4j

    Open source web crawler for Java

    crawler4j is an open source web crawler for Java which provides a simple interface for crawling the Web. Using it, you can setup a multi-threaded web crawler in few minutes. You need to create a crawler class that extends WebCrawler. This class decides which URLs should be crawled and handles the downloaded page. shouldVisit function decides whether the given URL should be crawled or not.
    Downloads: 1 This Week
    Last Update:
    See Project
  • AI-powered conversation intelligence software Icon
    AI-powered conversation intelligence software

    Unlock call analytics that provide actionable insights with our call tracking software, empowering you to identify what's working and what's not.

    Every customer interaction is vital to your business success and revenue growth. With Jiminny’s AI-powered conversation intelligence software, we take recording, capturing, and meticulous analysis of call recordings to the next level. Unlock call analytics that provide actionable insights with our call tracking software, empowering you to identify what's working and what's not. Seamlessly support your biggest objectives across the entire business landscape with our innovative call tracking system.
    Learn More
  • 10
    Extended Memory Semantics (EMS)

    Extended Memory Semantics (EMS)

    Persistent shared object memory and parallelism for Node.js and Python

    EMS makes possible persistent shared memory parallelism between Node.js, Python, and C/C++. Extended Memory Semantics (EMS) unifies synchronization and storage primitives to address several challenges of parallel programming. A modern multi-core server has 16-32 cores and nearly 1TB of memory, equivalent to an entire rack of systems from a few years ago. As a consequence, jobs formerly requiring a Map-Reduce cluster can now be performed entirely in shared memory on a single server without...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    PyXB (“pixbee”) is a pure Python package that generates Python source code for classes that correspond to data structures defined by XMLSchema. In concept it is similar to JAXB for Java and CodeSynthesis XSD for C++.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 12
    Note: latest version can be found at https://github.com/targeted/pythomnic3k Pythomnic3k is a Python 3 framework for service-oriented middleware with hot reloading and fault tolerance. It is used for integrating various systems in enterprise network or writing standalone network services.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    htmlarea

    htmlarea

    Small, powerful, full featured WYSIWYG editor

    HTMLArea 4 is a browser based WYSIWYG editor that easily replaces the TEXTAREA in your web pages. It is written in JavaScript, and suitable for use in any modern web browser, and any page on your web site. Current version is 4.0-2016-08-29
    Downloads: 10 This Week
    Last Update:
    See Project
  • 14
    Cloud Export is a tool to automatically extract your data from web applications and save it to your local file system for backup purposes, but more extensive than Google Takeout. Plans are based on http://www.dataliberation.org.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    DoCookBook

    DoCookBook

    Cookbook Style Document for DocBook Customizations

    This project has been moved to GitHub: https://github.com/tomschr/dbcookbook/ The DoCookBook project aims to create an open source book about DocBook and the DocBook XSL stylesheets written as a cookbook and released under a Creative Commons license.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    GreasySpoon

    GreasySpoon

    An ICAP Services Factory

    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    acp245

    ACP245 Suite: Brazil Telematics protocol implementation

    This "suite" includes a portable reference implementation and testing tools for ACP245, the automotive Telematics protocol defined by Brazil government. see: http://www.denatran.gov.br/simrav/simrav.asp
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    SPARQL Endpoint interface to Python

    This project has been moved to http://rdflib.github.io/sparqlwrapper/

    A library to allow query a SPARQL end-point in Python
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    blog4y

    blog4y

    Easy blogging for you.

    Project goal is to allow you easy post and backup blog articles to most free blogging hosting. Provide Python API, command line utility, Emacs integration and GTK interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Framework (scripts, configuration, code) to build free and public services around travel and leisure data. That project makes an extensive use of already existing data sources such as Geonames and dbPedia, and adds some glue around those (eg, links).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    python library with utility classes for: - access mysql via - nevow / form - mangaing form and new field for form - building pdf report with reportlab
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Wiko, the wiki compiler, compiles wiki like files into html and LaTeX, combining easy wiki syntax, your preferred non-web text editor and svn/cvs control to write static webs, cientific articles or even blogs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Aurora Application Server is a new Python Web Application Server and Framework. The main goal of the project is to provide the developer with a complete set of tools to speed up the application development process. See project wiki for more information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    JS SmartM3 KP API

    javascript API for SmartM3

    Porting of the SmartM3 Triple Space's KP on JavaScript. Due to limitations on JS connectivity a "WebSocket to TCP" relay has been developed in order to enable JavaScript KP to communicato with a SmartM3 SIB. Relay is based on jWebSocketServer. User manual currently only in italian :(
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Where In the World Have You Been?
    A PHP script with maps of the World, China, Canada, USA, India, Africa and Europe that allows the user to select the countries, provinces or states by clicking on them or selecting a checkboxes. Selection causes the entity to turn a default color which contrasts with defaults colors of all bordering countries. Thus a patchwork is made to show the history of countries, states or provinces traveled. Added features allow users to download their maps, to blow them up to posters of any...
    Downloads: 0 This Week
    Last Update:
    See Project