Showing 148 open source projects for "file python"

View related business solutions
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    Academic File Converter

    Academic File Converter

    Import Bibtex publications and Jupyter Notebook posts into websites

    Easily import publications and Jupyter notebooks to your Markdown-formatted website or book.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    OCRmyPDF

    OCRmyPDF

    OCRmyPDF adds an OCR text layer to scanned PDF files

    OCRmyPDF adds an optical character recognition (OCR) text layer to scanned PDF files, allowing them to be searched. PDF is the best format for storing and exchanging scanned documents. Unfortunately, PDFs can be difficult to modify. OCRmyPDF makes it easy to apply image processing and OCR (recognized, searchable text) to existing PDFs.
    Downloads: 59 This Week
    Last Update:
    See Project
  • 3
    Docker Compose

    Docker Compose

    Define and run multi-container applications with Docker

    Docker Compose is an open source tool for defining and running multi-container applications with Docker. Compose lets you use a YAML file to configure your application’s services, and then create and start all the services from your configuration with just a single command. Compose works great in all environments: production, staging, testing, development, and on CI workflows. Compose has commands for every stage of your application lifecycle, from starting, stopping and rebuilding services...
    Downloads: 46 This Week
    Last Update:
    See Project
  • 4
    Pix2Text

    Pix2Text

    Open-Source Python3 tool for recognizing layouts, tables, and math

    An Open-Source Python3 tool for recognizing layouts, tables, math formulas, and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported. Pix2Text (P2T) aims to be a free and open-source Python alternative to Mathpix, and it can already accomplish Mathpix's core functionality. Pix2Text (P2T) can recognize layouts, tables, images, text, and mathematical...
    Downloads: 12 This Week
    Last Update:
    See Project
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    Build gen AI apps with an all-in-one modern database: MongoDB Atlas

    MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.
    Start Free
  • 5
    CssSelector Component

    CssSelector Component

    Converts CSS selectors to XPath expressions

    XPath expressions are incredibly flexible, so there is almost always an XPath expression that will find the element you need. Unfortunately, they can also become very complicated, and the learning curve is steep. Even common operations (such as finding an element with a particular class) can require long and unwieldy expressions. CSS selectors are less powerful than XPath, but far easier to write, read and understand. Since they are less powerful, almost all CSS selectors can be converted to...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    wikmd

    wikmd

    A file based wiki that uses markdown

    It’s a file-based wiki that aims to simplicity. Instead of storing the data in a database I chose to have a file-based system. The advantage of this system is that every file is directly readable inside a terminal etc. Also when you have direct access to the system you can export the files to anything you like.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    RenderCV

    RenderCV

    LaTeX CV generator from a YAML/JSON input file

    RenderCV is a LaTeX CV/resume framework. It allows you to create a high-quality CV as a PDF from a YAML file with full Markdown syntax support and complete control over the LaTeX code. RenderCV offers built-in LaTeX and Markdown templates ready to produce high-quality CVs. However, the templates are entirely arbitrary and can easily be updated to leverage RenderCV's capabilities with your custom CV themes.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    sqlitebiter

    sqlitebiter

    A CLI tool to convert CSV / Excel / HTML / JSON / Jupyter Notebook

    A CLI tool to convert CSV / Excel / HTML / JSON / Jupyter Notebook / LDJSON / LTSV / Markdown / SQLite / SSV / TSV / Google-Sheets to an SQLite database file.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    jsondiff

    jsondiff

    Diff JSON and JSON-like structures in Python

    Diff JSON and JSON-like structures in Python.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Crowdtesting That Delivers | Testeum Icon
    Crowdtesting That Delivers | Testeum

    Unfixed bugs delaying your launch? Test with real users globally – check it out for free, results in days.

    Testeum connects your software, app, or website to a worldwide network of testers, delivering detailed feedback in under 48 hours. Ensure functionality and refine UX on real devices, all at a fraction of traditional costs. Trusted by startups and enterprises alike, our platform streamlines quality assurance with actionable insights.
    Click to perfect your product now.
  • 10
    srsly

    srsly

    Modern high-performance serialization utilities for Python

    This package bundles some of the best Python serialization libraries into one standalone package, with a high-level API that makes it easy to write code that's correct across platforms and Pythons. This allows us to provide all the serialization utilities we need in a single binary wheel. Currently supports JSON, JSONL, MessagePack, Pickle, and YAML. Serialization is hard, especially across Python versions and multiple platforms. After dealing with many subtle bugs over the years (encodings...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    JC

    JC

    CLI tool and python library

    CLI tool and python library that converts the output of popular command-line tools and file types to JSON or Dictionaries. This allows piping of output to tools like jq and simplifying automation scripts. jc JSONifies the output of many CLI tools and file types for easier parsing in scripts. This allows further command-line processing of output with tools like jq or jello by piping commands. The JC parsers can also be used as python modules. In this case, the output will be a python dictionary...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    StringZilla

    StringZilla

    10x faster string search, split, sort, and shuffle for long strings

    StringZilla is the Godzilla of string libraries, splitting, sorting, and shuffling large textual datasets. StringZilla uses a heuristic so simple it's almost stupid... but it works. It matches the first few letters of words with hyper-scalar code to achieve memcpy speeds. The implementation fits into a single C 99 header file and uses different SIMD flavors and SWAR on older platforms. The Str is designed to replace long Python str strings and wrap our C-level API. On the other hand, the File...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    jello

    jello

    CLI tool to filter JSON and JSON Lines data with Python syntax

    Filter JSON and JSON Lines data with Python syntax. jello is similar to jq in that it processes JSON and JSON Lines data except jello uses standard python dict and list syntax. JSON or JSON Lines can be piped into jello via STDIN or can be loaded from a JSON file or JSON Lines files (JSON Lines are automatically slurped into a list of dictionaries). Once loaded, the data is available as a python list or dictionary object named '_'. Processed data can be output as JSON, JSON Lines, bash array...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Rapid LaTeX OCR

    Rapid LaTeX OCR

    Formula recognition based on LaTeX-OCR and ONNXRuntime

    ... to LaTeX-OCR. When installing the package through pip, the model file will be automatically downloaded and placed under models in the installation directory.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    LovelyPlots

    LovelyPlots

    Matplotlib style sheets to nicely format figures for scientific papers

    LovelyPlots is a repository containing matplotlib style sheets to nicely format figures for scientific papers, theses, and presentations while keeping them fully editable in Adobe Illustrator. Additionally, .svg export options allow figures to automatically adapt their font to your document's font. For example, .svg figures imported in a .tex file will automatically be generated with the text font used in your .tex file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    adstex

    adstex

    Automated generation of NASA ADS bibtex entries directly from citation

    adstex automatically identifies all citation keys (e.g., identifiers, author+year) in your TeX source files and uses NASA's Astrophysics Data System (ADS) to generate corresponding bibtex entries. Write your papers without worrying about the bibtex entries. Simply put down arXiv IDs, ADS bibcodes, DOIs, or first author & year citation keys in your \cite commands, and then use adstex to automatically generate the bibtex file for you.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    arxiv_latex_cleaner

    arxiv_latex_cleaner

    arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper

    This tool allows you to easily clean the LaTeX code of your paper to submit to arXiv. From a folder containing all your code, e.g. /path/to/latex/, it creates a new folder /path/to/latex_arXiv/, that is ready to ZIP and upload to arXiv.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    HTTPie CLI

    HTTPie CLI

    Modern, user-friendly command-line HTTP client for the API era

    HTTPie (pronounced aitch-tee-tee-pie) is a command-line HTTP client. Its goal is to make CLI interaction with web services as human-friendly as possible. HTTPie is designed for testing, debugging, and generally interacting with APIs & HTTP servers. The HTTP & HTTPS commands allow for creating and sending arbitrary HTTP requests. They use simple and natural syntax and provide formatted and colorized output.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Pandas Profiling

    Pandas Profiling

    Create HTML profiling reports from pandas DataFrame objects

    ..., separator), scripts (Latin, Cyrillic) and blocks (ASCII, Cyrilic). File sizes, creation dates, dimensions, indication of truncated images and existance of EXIF metadata. Mostly global details about the dataset (number of records, number of variables, overall missigness and duplicates, memory footprint). Comprehensive and automatic list of potential data quality issues (high correlation, skewness, uniformity, zeros, missing values, constant values, between others).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Pelican

    Pelican

    Static site generator that supports Markdown and reST syntax

    .... Extensible via a rich plugin ecosystem, Pelican Plugins. Pelican considers “articles” to be chronological content, such as posts on a blog, and thus associated with a date. The idea behind “pages” is that they are usually not temporal in nature and are used for content that does not change very often (e.g., “About” or “Contact” pages). Pelican tries to be smart enough to get the information it needs from the file system.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    asammdf

    asammdf

    Fast Python reader and editor for ASAM MDF / MF4 (Measurement Format)

    *asammdf* is a fast Python parser and editor for ASAM (Associtation for Standardisation of Automation and Measuring Systems) MDF / MF4 (Measurement Data Format) files. It supports MDF versions 2 (.dat), 3 (.mdf) and 4 (.mf4). *asammdf* works on Python 2.7, and Python >= 3.4
    Downloads: 45 This Week
    Last Update:
    See Project
  • 22
    csv2odf

    csv2odf

    csv2odf can convert csv data to formatted spreadsheets and documents.

    csv2odf can create business intelligence reports from csv data sources with output to ods, odt, html, xlsx, or docx documents. It uses a template file that you design to control the layout, fonts, and colors. Just query your database with output to csv (or tsv), then use csv2odf to insert the data into your template to produce a nice looking formatted output. It is a command line tool and you can automate the generation of reports by using scripts and cron. It can be used to create spreadsheets...
    Leader badge
    Downloads: 6 This Week
    Last Update:
    See Project
  • 23
    NeuroJSON

    NeuroJSON

    Scalable, searchable and verifiable neuroimaging data exchange platfor

    Downloads: 4 This Week
    Last Update:
    See Project
  • 24

    Create Index from PDF

    PDF Indexing Script: Searches PDF for words, records page numbers

    This Python script helps automate the process of creating an index for a PDF document. It reads a list of words from a text file, searches through each page of the PDF, and records the page numbers where each word appears. The script accounts for the first 24 pages of the PDF that use Roman numerals (i-xxiv) and adjusts the page numbers accordingly. It is designed to be case-insensitive, ensuring that variations in capitalization do not affect the search results. As it processes the PDF...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    PoJamas aims to provide a Python and tools for loading, processing, and producing .cr2, pz3 (crz, pzz) files compatible with the SmithMicro (e-frontier) Poser character animation application. PoJamas is composed of: - Python library - Python Wavefront (.obj) 3D viewer based on GLFW - LibreOffice/Python Application (to ease the library and the viewer usage) As of 2020, the project is ported in Python3 As of 2021 this project proposes a 3D viewer for Wavefront files (pypos3dv...
    Downloads: 0 This Week
    Last Update:
    See Project
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.