Showing 139 open source projects for "open any file"

View related business solutions
  • Top-Rated Free CRM Software Icon
    Top-Rated Free CRM Software

    216,000+ customers in over 135 countries grow their businesses with HubSpot

    HubSpot is an AI-powered customer platform with all the software, integrations, and resources you need to connect your marketing, sales, and customer service. HubSpot's connected platform enables you to grow your business faster by focusing on what matters most: your customers.
    Get started free
  • Auth0 Free: 25K MAUs + 5-Min Setup Icon
    Auth0 Free: 25K MAUs + 5-Min Setup

    Enterprise Auth, Zero Friction: Any Framework • 30+ SDKs • Universal Login

    Production-ready login in 10 lines of code. SSO, MFA & social auth included. Scale seamlessly beyond free tier with Okta’s enterprise security.
    Get Your API Keys
  • 1
    Academic File Converter

    Academic File Converter

    Import Bibtex publications and Jupyter Notebook posts into websites

    Easily import publications and Jupyter notebooks to your Markdown-formatted website or book.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    OCRmyPDF

    OCRmyPDF

    OCRmyPDF adds an OCR text layer to scanned PDF files

    OCRmyPDF adds an optical character recognition (OCR) text layer to scanned PDF files, allowing them to be searched. PDF is the best format for storing and exchanging scanned documents. Unfortunately, PDFs can be difficult to modify. OCRmyPDF makes it easy to apply image processing and OCR (recognized, searchable text) to existing PDFs.
    Downloads: 23 This Week
    Last Update:
    See Project
  • 3
    Cortex Analyzers

    Cortex Analyzers

    Cortex Analyzers Repository

    Analyzers can be written in any programming language supported by Linux such as Python, Ruby, Perl, etc. Refer to the How to Write and Submit an Analyzer page for details on how to write and submit one.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    Pix2Text

    Pix2Text

    Open-Source Python3 tool for recognizing layouts, tables, and math

    ... formulas, and integrate all of these contents into Markdown format. P2T can also convert an entire PDF file (which can contain scanned images or any other format) into Markdown format.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Never Get Blocked Again | Enterprise Web Scraping Icon
    Never Get Blocked Again | Enterprise Web Scraping

    Enterprise-Grade Proxies • Built-in IP Rotation • 195 Countries • 20K+ Companies Trust Us

    Get unrestricted access to public web data with our ethically-sourced proxy network. Automated session management and advanced unblocking handle the hard parts. Scale from 1 to 1M requests with zero blocks. Built for developers with ready-to-use APIs, serverless functions, and complete documentation. Used by 20,000+ companies including Fortune 500s. SOC2 and GDPR compliant.
    Get Started
  • 5
    Rapid LaTeX OCR

    Rapid LaTeX OCR

    Formula recognition based on LaTeX-OCR and ONNXRuntime

    ... to LaTeX-OCR. When installing the package through pip, the model file will be automatically downloaded and placed under models in the installation directory.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6
    JC

    JC

    CLI tool and python library

    CLI tool and python library that converts the output of popular command-line tools and file types to JSON or Dictionaries. This allows piping of output to tools like jq and simplifying automation scripts. jc JSONifies the output of many CLI tools and file types for easier parsing in scripts. This allows further command-line processing of output with tools like jq or jello by piping commands. The JC parsers can also be used as python modules. In this case, the output will be a python dictionary...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    tqdm

    tqdm

    A Fast, Extensible Progress Bar for Python and CLI

    tqdm is a fast, extensible progress bar for Python and CLI that enables you to see the progress of your loops in a clear and smart way. Simply wrap any iterable with tqdm(iterable), and sit back and watch that progress meter go! tqdm can be wrapped around any iterable, or executed as a module with pipes. Just by inserting tqdm (or python -m tqdm) between pipes will pass through all stdin to stdout while printing progress to stderr. tqdm does not require any dependencies, has a very low...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    sqlitebiter

    sqlitebiter

    A CLI tool to convert CSV / Excel / HTML / JSON / Jupyter Notebook

    A CLI tool to convert CSV / Excel / HTML / JSON / Jupyter Notebook / LDJSON / LTSV / Markdown / SQLite / SSV / TSV / Google-Sheets to an SQLite database file.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    HTTPie CLI

    HTTPie CLI

    Modern, user-friendly command-line HTTP client for the API era

    HTTPie (pronounced aitch-tee-tee-pie) is a command-line HTTP client. Its goal is to make CLI interaction with web services as human-friendly as possible. HTTPie is designed for testing, debugging, and generally interacting with APIs & HTTP servers. The HTTP & HTTPS commands allow for creating and sending arbitrary HTTP requests. They use simple and natural syntax and provide formatted and colorized output.
    Downloads: 1 This Week
    Last Update:
    See Project
  • The #1 Embedded Analytics Solution for SaaS Teams. Icon
    The #1 Embedded Analytics Solution for SaaS Teams.

    Qrvey saves engineering teams time and money with a turnkey multi-tenant solution connecting your data warehouse to your SaaS application.

    Qrvey’s comprehensive embedded analytics software enables you to design more customizable analytics experiences for your end users.
    Try Developer Playground
  • 10
    Pelican

    Pelican

    Static site generator that supports Markdown and reST syntax

    .... Extensible via a rich plugin ecosystem, Pelican Plugins. Pelican considers “articles” to be chronological content, such as posts on a blog, and thus associated with a date. The idea behind “pages” is that they are usually not temporal in nature and are used for content that does not change very often (e.g., “About” or “Contact” pages). Pelican tries to be smart enough to get the information it needs from the file system.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    MathTranslate

    MathTranslate

    translate scientific papers in latex, especially arxiv papers

    This is a project to translate LaTeX documents, especially scientific papers, from any language to any language. LaTeX expressions like math expressions are perfectly kept unchanged. LaTeX documents can finally be compiled into PDF files. Especially it can be directly applied to translate arXiv papers since it provides the LaTeX source code of most of the papers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    wikmd

    wikmd

    A file based wiki that uses markdown

    It’s a file-based wiki that aims to simplicity. Instead of storing the data in a database I chose to have a file-based system. The advantage of this system is that every file is directly readable inside a terminal etc. Also when you have direct access to the system you can export the files to anything you like.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    RenderCV

    RenderCV

    LaTeX CV generator from a YAML/JSON input file

    RenderCV is a LaTeX CV/resume framework. It allows you to create a high-quality CV as a PDF from a YAML file with full Markdown syntax support and complete control over the LaTeX code. RenderCV offers built-in LaTeX and Markdown templates ready to produce high-quality CVs. However, the templates are entirely arbitrary and can easily be updated to leverage RenderCV's capabilities with your custom CV themes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    jsondiff

    jsondiff

    Diff JSON and JSON-like structures in Python

    Diff JSON and JSON-like structures in Python.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    LovelyPlots

    LovelyPlots

    Matplotlib style sheets to nicely format figures for scientific papers

    LovelyPlots is a repository containing matplotlib style sheets to nicely format figures for scientific papers, theses, and presentations while keeping them fully editable in Adobe Illustrator. Additionally, .svg export options allow figures to automatically adapt their font to your document's font. For example, .svg figures imported in a .tex file will automatically be generated with the text font used in your .tex file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Markdown package LaTeX

    Markdown package LaTeX

    Package for converting and rendering markdown documents in TeX

    The Markdown package converts CommonMark markup to TeX commands. The functionality is provided both as a Lua module, and as plain TeX, LaTeX, and ConTeXt macro packages that can be used to directly typeset TeX documents containing markdown markup. Unlike other convertors, the Markdown package does not require any external programs and makes it easy to redefine how each and every markdown element is rendered. Creative abuse of the markdown syntax is encouraged.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    mistletoe

    mistletoe

    A fast, extensible and spec-compliant Markdown parser in pure Python

    mistletoe is a Markdown parser in pure Python, designed to be fast, spec-compliant and fully customizable. Apart from being the fastest CommonMark-compliant Markdown parser implementation in pure Python, mistletoe also supports easy definitions of custom tokens. Parsing Markdown into an abstract syntax tree also allows us to swap out renderers for different output formats, without touching any of the core components.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    SPyQL

    SPyQL

    Query data on the command line with SQL-like SELECTs powered by Python

    SQL with Python in the middle. SPyQL is a query language that combines the simplicity and structure of SQL with the power and readability of Python. SPyQL offers a command-line interface that allows running SPyQL queries on top of text data (e.g. CSV, JSON). Data can come from files but also from data streams, such as as Kafka, or from databases such as PostgreSQL. Basically, data can come from any command that outputs text :-). More, data can be generated by a Python expression! And since...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    srsly

    srsly

    Modern high-performance serialization utilities for Python

    This package bundles some of the best Python serialization libraries into one standalone package, with a high-level API that makes it easy to write code that's correct across platforms and Pythons. This allows us to provide all the serialization utilities we need in a single binary wheel. Currently supports JSON, JSONL, MessagePack, Pickle, and YAML. Serialization is hard, especially across Python versions and multiple platforms. After dealing with many subtle bugs over the years (encodings,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Pandas Profiling

    Pandas Profiling

    Create HTML profiling reports from pandas DataFrame objects

    ..., separator), scripts (Latin, Cyrillic) and blocks (ASCII, Cyrilic). File sizes, creation dates, dimensions, indication of truncated images and existance of EXIF metadata. Mostly global details about the dataset (number of records, number of variables, overall missigness and duplicates, memory footprint). Comprehensive and automatic list of potential data quality issues (high correlation, skewness, uniformity, zeros, missing values, constant values, between others).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    adstex

    adstex

    Automated generation of NASA ADS bibtex entries directly from citation

    adstex automatically identifies all citation keys (e.g., identifiers, author+year) in your TeX source files and uses NASA's Astrophysics Data System (ADS) to generate corresponding bibtex entries. Write your papers without worrying about the bibtex entries. Simply put down arXiv IDs, ADS bibcodes, DOIs, or first author & year citation keys in your \cite commands, and then use adstex to automatically generate the bibtex file for you.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    arxiv_latex_cleaner

    arxiv_latex_cleaner

    arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper

    This tool allows you to easily clean the LaTeX code of your paper to submit to arXiv. From a folder containing all your code, e.g. /path/to/latex/, it creates a new folder /path/to/latex_arXiv/, that is ready to ZIP and upload to arXiv.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    jello

    jello

    CLI tool to filter JSON and JSON Lines data with Python syntax

    Filter JSON and JSON Lines data with Python syntax. jello is similar to jq in that it processes JSON and JSON Lines data except jello uses standard python dict and list syntax. JSON or JSON Lines can be piped into jello via STDIN or can be loaded from a JSON file or JSON Lines files (JSON Lines are automatically slurped into a list of dictionaries). Once loaded, the data is available as a python list or dictionary object named '_'. Processed data can be output as JSON, JSON Lines, bash array...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    crossplane

    crossplane

    Quick and reliable way to convert NGINX configurations into JSON

    Reliable and fast NGINX configuration file parser and builder. Since crossplane is usually used to create payloads that are sent to different servers, it's important to keep security in mind. For that reason, the --ignore option was added. It can be used to keep certain sensitive directives out of the payload output entirely.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Bots-ediint open source edi translator

    Bots-ediint open source edi translator

    open source edi translator

    Bots-ediint is a file translator for EDI (Electronic Data Interchange). Translate files and databases data format to another one: edifact, x12, tradacoms, xml, json, raw, database (postgres, sqlite, mysql, oracle, ...) and more. Bots-ediint is based on Bots https://bots.sourceforge.net Support Python 2 and Python 3 AS2 communications are carried out with my external software that could work in standalone and/or in a Bots channel to send and receive AS2 messages (SYNC and ASYNC...
    Downloads: 0 This Week
    Last Update:
    See Project