Showing 18 open source projects for "python pdf scaper"

View related business solutions
  • Deploy Apps in Seconds with Cloud Run Icon
    Deploy Apps in Seconds with Cloud Run

    Host and run your applications without the need to manage infrastructure. Scales up from and down to zero automatically.

    Cloud Run is the fastest way to deploy containerized apps. Push your code in Go, Python, Node.js, Java, or any language and Cloud Run builds and deploys it automatically. Get fast autoscaling, pay only when your code runs, and skip the infrastructure headaches. Two million requests free per month. And new customers get $300 in free credit.
    Try Cloud Run Free
  • Managed MySQL, PostgreSQL, and SQL Databases on Google Cloud Icon
    Managed MySQL, PostgreSQL, and SQL Databases on Google Cloud

    Get back to your application and leave the database to us. Cloud SQL automatically handles backups, replication, and scaling.

    Cloud SQL is a fully managed relational database for MySQL, PostgreSQL, and SQL Server. We handle patching, backups, replication, encryption, and failover—so you can focus on your app. Migrate from on-prem or other clouds with free Database Migration Service. IDC found customers achieved 246% ROI. New customers get $300 in credits plus a 30-day free trial.
    Try Cloud SQL Free
  • 1
    ArchiveBox

    ArchiveBox

    Open source self-hosted web archiving

    ArchiveBox is a powerful, self-hosted internet archiving solution to collect, save, and view websites offline. Without active preservation effort, everything on the internet eventually disappears or degrades. Archive.org does a great job as a centralized service, but saved URLs have to be public, and they can't save every type of content. ArchiveBox is an open source tool that lets organizations & individuals archive both public & private web content while retaining control over their data....
    Downloads: 9 This Week
    Last Update:
    See Project
  • 2
    JupyterLab

    JupyterLab

    JupyterLab computational environment

    JupyterLab is the next-generation web-based user interface for Project Jupyter. Try it on Binder. JupyterLab follows the Jupyter Community Guides. JupyterLab enables you to work with documents and activities such as Jupyter notebooks, text editors, terminals, and custom components in a flexible, integrated, and extensible manner. You can arrange multiple documents and activities side by side in the work area using tabs and splitters. Documents and activities integrate with each other,...
    Downloads: 271 This Week
    Last Update:
    See Project
  • 3
    Perf Book

    Perf Book

    The book "Performance Analysis and Tuning on Modern CPU"

    This project is a practical guide to performance analysis and tuning on modern CPUs, bridging microarchitecture details with hands-on profiling. It explains how caches, TLBs, prefetchers, branch predictors, and out-of-order execution influence real program speed, then connects those concepts to concrete optimization strategies. Readers learn how to design trustworthy benchmarks, avoid measurement traps (warmup, turbo, frequency scaling), and interpret hardware performance counters. The book...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Jina

    Jina

    Build cross-modal and multimodal applications on the cloud

    Jina is a framework that empowers anyone to build cross-modal and multi-modal applications on the cloud. It uplifts a PoC into a production-ready service. Jina handles the infrastructure complexity, making advanced solution engineering and cloud-native technologies accessible to every developer. Build applications that deliver fresh insights from multiple data types such as text, image, audio, video, 3D mesh, PDF with Jina AI’s DocArray. Polyglot gateway that supports gRPC, Websockets, HTTP,...
    Downloads: 1 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 5

    littleutils

    Various small and useful command-line utilities

    The littleutils include duplicate file finders (repeats, repeats.pl, repeats.py), image optimizers (opt-jpg, opt-png, opt-gif, recomp-jpg), file rename tools (lowercase, uppercase, pren), archive recompressors (to-gzip, to-bzip2, to-bzip3, to-7zip, to-lzma, to-lzip, to-xz), a tempfile utility (tempname), file property tools (filedate, filemode, filenode, fileown, filesize, and lrealpath), and others. See the README file for more details.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 6
    garysfm

    garysfm

    An advanced file manager with qss themes and iso and folder previews

    garysfm which stands for Gary's File Manager is a file manager with some advanced features. Those features include bulk renaming and folder image previews. I has rather advanced search functions, tab browsing with persistence between launches. It remembers your folder sorting and view options in icon view. It also remembers your active tabs between sessions. It has progress dialog while doing large operations like copying large files, and folders with many files. python version works on...
    Downloads: 17 This Week
    Last Update:
    See Project
  • 7
    DeepSearch5Plus

    DeepSearch5Plus

    Search recursively all files, text inside files, and bookmarks

    Files, text, and bookmarks html/json search tool. It is able to merge duplicated bookmark links from multiple html/json files and export to html. The user interface is designed to be used by kyeboard in any aspect of its features scrolling list, select input field, menu bar, check and drop-down menu and so on. It's able to printout search list to pdf or system printer or save as text file. It allows to rename single file or change upper/lower case, join spaces in bulk mode with multiple...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    RoshanOS

    RoshanOS

    RoshanOS is a friendly distro with welcome, layout, tweaks & tryouts

    ...It’s based on MX/Debian with customized layout & tweaks where screen corners & sides are not useless. RoshanOS contains tryouts folder on the desktop with instruction files to try Basic/C/C++/Java/Python/PHP/Perl computer programming languages out of the box. It also includes MySQL, PhpMyAdmin, XAMPP server. Following files are supported along others docx, xlsx, pptx, pdf, rtf, csv, zip, photo formats: jpg/png/gif/tiff/ico/svg/webp, video formats: avi/mov/wmv/mp4/ogg/webm. Microsoft Windows and Google Android apps can be run on RoshanOS 3 (advanced users only). ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 9
    Mininet

    Mininet

    Emulator for rapid prototyping of Software Defined Networks

    Mininet creates a realistic virtual network, running real kernel, switch and application code, on a single machine (VM, cloud or native), in seconds, with a single command. Because you can easily interact with your network using the Mininet CLI (and API), customize it, share it with others, or deploy it on real hardware, Mininet is useful for development, teaching, and research. Mininet is also a great way to develop, share, and experiment with Software-Defined Networking (SDN) systems using...
    Downloads: 23 This Week
    Last Update:
    See Project
  • Build on Google Cloud with $300 in Free Credit Icon
    Build on Google Cloud with $300 in Free Credit

    New to Google Cloud? Get $300 in free credit to explore Compute Engine, BigQuery, Cloud Run, Vertex AI, and 150+ other products.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query exabytes in BigQuery, or build AI apps with Vertex AI and Gemini. Once your credits are used, keep building with 20+ products with free monthly usage, including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. Sign up to start building right away.
    Start Free Trial
  • 10
    sar2html
    Sar2html is web based frontend for performance monitoring. It converts sar binary data to graphical format and keep historical data in it's database. Project homepage is here: https://github.com/cemtan/sar2html.git Supported Operating Systems: HPUX 11.11, 11.23, 11,31 Solaris 5.9, 5.10, 5.11 Redhat 3, 4, 5, 6, 7 Suse 8, 9, 10, 11, 12 Ubuntu 18, 20 If you have customers facing performance problems on operating systems listed above you may send sar2ascii to collect...
    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    RadicalSpam Virtual Appliance

    RadicalSpam Virtual Appliance

    Virtual Appliance of RadicalSpam

    RadicalSpam Virtual Appliance takes full solution of RadicalSpam Community Edition , pre-installed in a OVF virtual machine ( Open Virtual Format ) compatible with the best virtualization platforms on the market , including VMware ESX Server. More information : http://www.radical-spam.org
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    PyLogAnalyser

    A Python multiplatform tool to filter, colorise and analyse logs

    PyLogAnalyzer is a tool that receives an input log in black and white, a configuration INI file, which contains the list of rules to process the input, and an output file where to save the results. These rules permit to detect an input line according to a regular expression (regex) or line number range, filter it, pass it, colorise in foreground and background, columnise the groups of the regex and enable or disable the rule. The final goal of this tool is to ameliorate reading long and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    LinuxSA

    Linux security assessment tool.

    LinuxSA is a Python project that scans a remote or a local Linux machine for security vulnerabilities and then creates a report of the findings. The scan is read-only and no modifications are made to the target system.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    Open Time & Attendance

    Open Time & Attendance System (Zeiterfassung)

    Zeiterfassung für Kleine und Mittelständiges Unternehmen. Folgende Funktionen beinhaltetet diese Software: - Daten werden in eine MySQL-Datenbank gespeichert [noch nicht Vorhanden] - Daten könne über eine Softwareschnittstelle auslesen werden. [noch nicht Vorhanden] - Protokoll ausgabe per PDF. [noch nicht Vorhanden] Angedachte Funktionen - Zeiterfassung von Projekten - Urlaubsplanung - Personalplanung Hardware Zur zeit auf einen Raspberry Pi und einen RFID-Karten lesen (RMD...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    Penetration-Testing-Toolkit v1.0

    A web interface for various penetration testing tools

    Penetration-Testing-Toolkit is a web based project to automate Scanning a network,Exploring CMS, Generating Undectable metasploit payload, DNS-Queries, IP related informations, Information Gathering, Domain related info etc
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    A command line tool that allows you to (live) analyze netfilter (iptables) log files. It provides a nice output and has features like displaying hostnames, duplicate detection... More info (+ screenshots) can be found in the pdf inside the tarball.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    savepex is a backup software writen in python. It allow you to select (also drag & drop) multiple file/directory, save your preferences in a file and recall it. savepex support command-line. All the operation are logged into a file and in a pdf file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    Digital Archive Creator for OCS

    Digital Archive Creator for OCS Inventory NG

    This software is written in python and aim to add export function to OCS Inventory NG for a formatted xml with custom stylesheet in a pdf or html. DAC can export a single computer or all the computers. Web version and Cli version
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB
Gen AI apps are built with MongoDB Atlas
Atlas offers built-in vector search and global availability across 125+ regions. Start building AI apps faster, all in one place.