Showing 23 open source projects for "python pdf scaper"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    ArchiveBox

    ArchiveBox

    Open source self-hosted web archiving

    ArchiveBox is a powerful, self-hosted internet archiving solution to collect, save, and view websites offline. Without active preservation effort, everything on the internet eventually disappears or degrades. Archive.org does a great job as a centralized service, but saved URLs have to be public, and they can't save every type of content. ArchiveBox is an open source tool that lets organizations & individuals archive both public & private web content while retaining control over their data....
    Downloads: 9 This Week
    Last Update:
    See Project
  • 2
    JupyterLab

    JupyterLab

    JupyterLab computational environment

    JupyterLab is the next-generation web-based user interface for Project Jupyter. Try it on Binder. JupyterLab follows the Jupyter Community Guides. JupyterLab enables you to work with documents and activities such as Jupyter notebooks, text editors, terminals, and custom components in a flexible, integrated, and extensible manner. You can arrange multiple documents and activities side by side in the work area using tabs and splitters. Documents and activities integrate with each other,...
    Downloads: 271 This Week
    Last Update:
    See Project
  • 3
    Perf Book

    Perf Book

    The book "Performance Analysis and Tuning on Modern CPU"

    This project is a practical guide to performance analysis and tuning on modern CPUs, bridging microarchitecture details with hands-on profiling. It explains how caches, TLBs, prefetchers, branch predictors, and out-of-order execution influence real program speed, then connects those concepts to concrete optimization strategies. Readers learn how to design trustworthy benchmarks, avoid measurement traps (warmup, turbo, frequency scaling), and interpret hardware performance counters. The book...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Jina

    Jina

    Build cross-modal and multimodal applications on the cloud

    Jina is a framework that empowers anyone to build cross-modal and multi-modal applications on the cloud. It uplifts a PoC into a production-ready service. Jina handles the infrastructure complexity, making advanced solution engineering and cloud-native technologies accessible to every developer. Build applications that deliver fresh insights from multiple data types such as text, image, audio, video, 3D mesh, PDF with Jina AI’s DocArray. Polyglot gateway that supports gRPC, Websockets, HTTP,...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Ship AI Apps Faster with Vertex AI Icon
    Ship AI Apps Faster with Vertex AI

    Go from idea to deployed AI app without managing infrastructure. Vertex AI offers one platform for the entire AI development lifecycle.

    Ship AI apps and features faster with Vertex AI—your end-to-end AI platform. Access Gemini 3 and 200+ foundation models, fine-tune for your needs, and deploy with enterprise-grade MLOps. Build chatbots, agents, or custom models. New customers get $300 in free credit.
    Try Vertex AI Free
  • 5
    UtilityHub

    UtilityHub

    UtilityHub is a lightweight, all-in-one desktop utility.

    ...User Interface • Clean & Intuitive GUI • Dark Theme for comfortable long-duration usage • Beginner-friendly design with minimal learning curve Technology Stack • Python • Tkinter (GUI) • SQLite (Local Storage) • Pillow / PDF Libraries • Packaged as a standalone Windows EXE System Requirements • Operating System: Windows 7 / 8 / 10 / 11 • No Internet Required • No External Dependencies ________________________________________ Installation 1. Download UtilityHub.exe 2. Double-click to run 3. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    File Format Renamer
    This tool only changes the file extension, meaning it changes one label for another, such as changing a .jpg to a .txt. Be careful: this is not a converter. So, if you change the label of a file from .jpg to .pdf, the file will still be an image and will not become a PDF. It can be useful for files of the same type. For example, if you have different types of images, you can make your computer see them all as if they were of the same kind.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7

    littleutils

    Various small and useful command-line utilities

    The littleutils include duplicate file finders (repeats, repeats.pl, repeats.py), image optimizers (opt-jpg, opt-png, opt-gif, recomp-jpg), file rename tools (lowercase, uppercase, pren), archive recompressors (to-gzip, to-bzip2, to-bzip3, to-7zip, to-lzma, to-lzip, to-xz), a tempfile utility (tempname), file property tools (filedate, filemode, filenode, fileown, filesize, and lrealpath), and others. See the README file for more details.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 8
    garysfm

    garysfm

    An advanced file manager with qss themes and iso and folder previews

    garysfm which stands for Gary's File Manager is a file manager with some advanced features. Those features include bulk renaming and folder image previews. I has rather advanced search functions, tab browsing with persistence between launches. It remembers your folder sorting and view options in icon view. It also remembers your active tabs between sessions. It has progress dialog while doing large operations like copying large files, and folders with many files. python version works on...
    Downloads: 17 This Week
    Last Update:
    See Project
  • 9
    DeepSearch5Plus

    DeepSearch5Plus

    Search recursively all files, text inside files, and bookmarks

    Files, text, and bookmarks html/json search tool. It is able to merge duplicated bookmark links from multiple html/json files and export to html. The user interface is designed to be used by kyeboard in any aspect of its features scrolling list, select input field, menu bar, check and drop-down menu and so on. It's able to printout search list to pdf or system printer or save as text file. It allows to rename single file or change upper/lower case, join spaces in bulk mode with multiple...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Deploy Apps in Seconds with Cloud Run Icon
    Deploy Apps in Seconds with Cloud Run

    Host and run your applications without the need to manage infrastructure. Scales up from and down to zero automatically.

    Cloud Run is the fastest way to deploy containerized apps. Push your code in Go, Python, Node.js, Java, or any language and Cloud Run builds and deploys it automatically. Get fast autoscaling, pay only when your code runs, and skip the infrastructure headaches. Two million requests free per month. And new customers get $300 in free credit.
    Try Cloud Run Free
  • 10
    RoshanOS

    RoshanOS

    RoshanOS is a friendly distro with welcome, layout, tweaks & tryouts

    ...It’s based on MX/Debian with customized layout & tweaks where screen corners & sides are not useless. RoshanOS contains tryouts folder on the desktop with instruction files to try Basic/C/C++/Java/Python/PHP/Perl computer programming languages out of the box. It also includes MySQL, PhpMyAdmin, XAMPP server. Following files are supported along others docx, xlsx, pptx, pdf, rtf, csv, zip, photo formats: jpg/png/gif/tiff/ico/svg/webp, video formats: avi/mov/wmv/mp4/ogg/webm. Microsoft Windows and Google Android apps can be run on RoshanOS 3 (advanced users only). ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    Mininet

    Mininet

    Emulator for rapid prototyping of Software Defined Networks

    Mininet creates a realistic virtual network, running real kernel, switch and application code, on a single machine (VM, cloud or native), in seconds, with a single command. Because you can easily interact with your network using the Mininet CLI (and API), customize it, share it with others, or deploy it on real hardware, Mininet is useful for development, teaching, and research. Mininet is also a great way to develop, share, and experiment with Software-Defined Networking (SDN) systems using...
    Downloads: 23 This Week
    Last Update:
    See Project
  • 12
    sar2html
    Sar2html is web based frontend for performance monitoring. It converts sar binary data to graphical format and keep historical data in it's database. Project homepage is here: https://github.com/cemtan/sar2html.git Supported Operating Systems: HPUX 11.11, 11.23, 11,31 Solaris 5.9, 5.10, 5.11 Redhat 3, 4, 5, 6, 7 Suse 8, 9, 10, 11, 12 Ubuntu 18, 20 If you have customers facing performance problems on operating systems listed above you may send sar2ascii to collect...
    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Chromensics - Google Chrome Forensics

    Chromensics - Google Chrome Forensics

    A Google chrome forensics tool

    The Chromensics tool is developed to read all information from chrome browser directory and present it to user, in easy readable tabular format which can be explored in descent interface without running the chrome browser. The tool will also allow you retrieve information from other chrome installation brought from different machine for analyzing. The acquired artifacts can be exported in PDF report to present it in court of law or to superiors. -Update 04/02/2017- Standardized all date...
    Leader badge
    Downloads: 6 This Week
    Last Update:
    See Project
  • 14
    RadicalSpam Virtual Appliance

    RadicalSpam Virtual Appliance

    Virtual Appliance of RadicalSpam

    RadicalSpam Virtual Appliance takes full solution of RadicalSpam Community Edition , pre-installed in a OVF virtual machine ( Open Virtual Format ) compatible with the best virtualization platforms on the market , including VMware ESX Server. More information : http://www.radical-spam.org
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    PyLogAnalyser

    A Python multiplatform tool to filter, colorise and analyse logs

    PyLogAnalyzer is a tool that receives an input log in black and white, a configuration INI file, which contains the list of rules to process the input, and an output file where to save the results. These rules permit to detect an input line according to a regular expression (regex) or line number range, filter it, pass it, colorise in foreground and background, columnise the groups of the regex and enable or disable the rule. The final goal of this tool is to ameliorate reading long and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    LinuxSA

    Linux security assessment tool.

    LinuxSA is a Python project that scans a remote or a local Linux machine for security vulnerabilities and then creates a report of the findings. The scan is read-only and no modifications are made to the target system.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    Open Time & Attendance

    Open Time & Attendance System (Zeiterfassung)

    Zeiterfassung für Kleine und Mittelständiges Unternehmen. Folgende Funktionen beinhaltetet diese Software: - Daten werden in eine MySQL-Datenbank gespeichert [noch nicht Vorhanden] - Daten könne über eine Softwareschnittstelle auslesen werden. [noch nicht Vorhanden] - Protokoll ausgabe per PDF. [noch nicht Vorhanden] Angedachte Funktionen - Zeiterfassung von Projekten - Urlaubsplanung - Personalplanung Hardware Zur zeit auf einen Raspberry Pi und einen RFID-Karten lesen (RMD...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    Penetration-Testing-Toolkit v1.0

    A web interface for various penetration testing tools

    Penetration-Testing-Toolkit is a web based project to automate Scanning a network,Exploring CMS, Generating Undectable metasploit payload, DNS-Queries, IP related informations, Information Gathering, Domain related info etc
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    PyPDFTools

    PDF manipulation tools (e.g. like pdftk or pdfsam) written in python.

    This are PDF manipulation tools (e.g. like pdftk, pdfsam, ...) written in python using pyPDF and thus portable to many different OS.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    MobiLinux | Linux Desktop for Android

    MobiLinux | Linux Desktop for Android

    Allowing you to use the Linux desktop everywhere you are

    MobiLinux is a mobile Linux distro based on Ubuntu 12.04 ARMv7 Build and is made to run on Android mobile phones while using little ram. This is based off http://sourceforge.net/projects/linuxonandroid/ and has been very much modified but should still be noted Requirments: - ARM based Android Mobile Phone - Android OS 2.2+ - ROOT ACCESS and BusyBox Binay - loop and ext2 kernel support - 2 GB of free space - 1 GHz CPU (dual-core or overclocked is recommended) - 512 RAM on board...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    A command line tool that allows you to (live) analyze netfilter (iptables) log files. It provides a nice output and has features like displaying hostnames, duplicate detection... More info (+ screenshots) can be found in the pdf inside the tarball.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    savepex is a backup software writen in python. It allow you to select (also drag & drop) multiple file/directory, save your preferences in a file and recall it. savepex support command-line. All the operation are logged into a file and in a pdf file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    Digital Archive Creator for OCS

    Digital Archive Creator for OCS Inventory NG

    This software is written in python and aim to add export function to OCS Inventory NG for a formatted xml with custom stylesheet in a pdf or html. DAC can export a single computer or all the computers. Web version and Cli version
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB
Gen AI apps are built with MongoDB Atlas
Atlas offers built-in vector search and global availability across 125+ regions. Start building AI apps faster, all in one place.