Showing 26 open source projects for "docx"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Cloud tools for web scraping and data extraction Icon
    Cloud tools for web scraping and data extraction

    Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.

    Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
    Explore 10,000+ tools
  • 1
    MegaParse

    MegaParse

    File Parser optimised for LLM Ingestion with no loss

    MegaParse is a file parser optimized for Large Language Model (LLM) ingestion, ensuring no loss of information. It efficiently parses various document formats, such as PDFs, DOCX, and PPTX, converting them into formats ideal for processing by LLMs. This tool is essential for applications that require accurate and comprehensive data extraction from diverse document types.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    ContextGem

    ContextGem

    ContextGem: Effortless LLM extraction from documents

    ContextGem is an open-source framework designed to simplify the extraction of structured data and insights from documents using large language models (LLMs). It provides a flexible, intuitive API that minimizes boilerplate code, enabling developers to build complex extraction workflows efficiently. ContextGem supports various document formats and integrates with multiple LLM providers, making it a versatile tool for tasks like contract analysis, anomaly detection, and information retrieval.​
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    LlamaParse

    LlamaParse

    Parse files for optimal RAG

    LlamaParse is a GenAI-native document parser that can parse complex document data for any downstream LLM use case (RAG, agents). Load in 160+ data sources and data formats, from unstructured, and semi-structured, to structured data (API's, PDFs, documents, SQL, etc.) Store and index your data for different use cases. Integrate with 40+ vector stores, document stores, graph stores, and SQL db providers.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    LLMStack

    LLMStack

    No-code multi-agent framework to build LLM Agents, workflows

    LLMStack is a no-code platform for building generative AI agents, workflows and chatbots, connecting them to your data and business processes. Build tailor-made generative AI agents, applications and chatbots that cater to your unique needs by chaining multiple LLMs. Seamlessly integrate your own data, internal tools and GPT-powered models without any coding experience using LLMStack's no-code builder. Trigger your AI chains from Slack or Discord. Deploy to the cloud or on-premise.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Total Network Visibility for Network Engineers and IT Managers Icon
    Total Network Visibility for Network Engineers and IT Managers

    Network monitoring and troubleshooting is hard. TotalView makes it easy.

    This means every device on your network, and every interface on every device is automatically analyzed for performance, errors, QoS, and configuration.
    Learn More
  • 5
    pdf2wordx

    pdf2wordx

    Convertir "pdf" a documentos ".docx"

    `pip install pdf2wordx` Este proyecto usa "Tkinter" V8.6 y usa "pdf2docx" V0.5.8 para realizar las conversiones de PDF a DOCX. El programa es fácil de usar, solo se dene seleccionar el archivo PDF, Bucar la carpeta donde se guardará el documento DOCX, finalmente de click en el botón "Convertir", el documento se convertirá y guardará en la ruta especificada.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    PDF-utility

    PDF-utility

    PDF Utility is a tool designed to efficiently manipulate PDF files

    Digna PDF Utility is a tool designed to efficiently manipulate PDF documents. It offers a range of functionalities including adding page numbers, deleting unwanted pages, merging multiple PDFs into a single file, converting PDF to DOCX and vice versa, protect a PDF file with password and displaying PDF content.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    bridgex

    bridgex

    Convert files like docx, xlsx, pptx, html, and more to MarkDown

    ... - Support for multiple input formats. - Lightweight editing prior to saving. Supported Formats 📂 Bridgex supports conversion of the following file formats: - PDF (.pdf) - Word (.docx) - PowerPoint (.pptx) - Excel (.xlsx, .xls, .csv) - Outlook Messages (.msg) - Text (.txt, .text) - Markdown (.md, .markdown) - JSON (.json, .jsonl) - XML (.xml) - RSS/Atom (.rss, .atom) - HTML/MHTML (.html, .htm, .mhtml) - ePub (.epub) - Compressed files (.zip) - Jupyter Notebooks (.ipynb) - Other formats supported by Markitdown Bridgex is not an IDE, text editor, Markdown editor, or document viewer
    Downloads: 9 This Week
    Last Update:
    See Project
  • 8

    pyLogos

    Qualitative content analysis software.

    pyLogos is a program to support text content analysis. Documents (imported from txt and docx files) are stored in a database, and may have marked text segments associated with codes. It is possible to retrieve these segments in various ways, generate word clouds, tabulate frequency of codes and words, among other outputs. pyLogos é um programa de apoio à análise de conteúdo de textos. Documentos (importados de arquivos txt e docx) são armazenados numa base de dados, podendo ter segmentos de textos marcados a associados a códigos. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    csv2odf

    csv2odf

    csv2odf can convert csv data to formatted spreadsheets and documents.

    csv2odf can create business intelligence reports from csv data sources with output to ods, odt, html, xlsx, or docx documents. It uses a template file that you design to control the layout, fonts, and colors. Just query your database with output to csv (or tsv), then use csv2odf to insert the data into your template to produce a nice looking formatted output. It is a command line tool and you can automate the generation of reports by using scripts and cron.
    Leader badge
    Downloads: 11 This Week
    Last Update:
    See Project
  • G-P - Global EOR Solution Icon
    G-P - Global EOR Solution

    Companies searching for an Employer of Record solution to mitigate risk and manage compliance, taxes, benefits, and payroll anywhere in the world

    With G-P's industry-leading Employer of Record (EOR) and Contractor solutions, you can hire, onboard and manage teams in 180+ countries — quickly and compliantly — without setting up entities.
    Learn More
  • 10
    Lexifinder

    Lexifinder

    A tool to create the analytical index of a manuscript

    Lexifinder is a free and open source tool to automate the creation of an analytical index of a manuscript, based on a natural language processing model. First, convert your Docx or ODT file into a PDF. Choose the output text file, set the similarity index, and choose your desired keywords. Lexifinder will include in the index all words whose significance resemble that of at least one keyword. The similarity index spans from 1 to 100 and expresses the degree of resemblance required for a noun to be included.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    pdf combiner merger converter splitter

    pdf combiner merger converter splitter

    PDF Combiner is a user-friendly, GUI-based tool built in

    PDF Combiner is a user-friendly open source free to use, GUI-based tool for combining, pdf to excel, pdf to word, image to pdf, zip, unzip annotate and splitting PDF files. It is easy to use, supports multiple file insert and delete and process, and allows you to adjust the order of files before combining.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 12
    LibreOffice

    LibreOffice

    A free and powerful office suite

    ...LibreOffice makes your work look great while you focus on the content, thanks to its powerful styles system and structuring tools. LibreOffice is compatible with a wide range of document formats such as Microsoft® Word (.doc, .docx), Excel (.xls, .xlsx), PowerPoint (.ppt, .pptx) and Publisher. But LibreOffice goes much further with its native support for a modern and open standard (OpenDocument Format).
    Leader badge
    Downloads: 933 This Week
    Last Update:
    See Project
  • 13
    pdf_docx_gen-ISA

    pdf_docx_gen-ISA

    PDF-Docx Generator [Improved.Simplified.Alternative]

    Converts pdf files into word file and vise-versa Compatible only for windows OS.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    Docx2PDF The Converter [I.S,A]

    Docx2PDF The Converter [I.S,A]

    Docx-2-PDF: The Converter [Improved.Simplified.Alternative]

    Docx-2-PDF Converter' is an desktop application developed using python 3.11.4 and other add-on libaries. Converts image file into PDF file. 'Image 2 PDF Converter' has two modes: 1) Single file - Convert one word (.docx) file into pdf file. 2) From Directory/Folder - Convert word (.docx) files into pdf files from a directory or folder.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15

    openpublipostage

    Édition automatisée de conventions de stage.

    Logiciel en Python permettant l'automatisations de l'édition de conventions de stage à partir d'un document en .doc ou .docx et d'une base de données en .xlsx.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Kaplan Desktop

    Kaplan Desktop

    Free and open-source CAT tool for linguists

    A free and open-source computer-assisted translation tool built with Django/Python and Electronjs/Nodejs. For the relevant repositories, please see https://github.com/kaplanPRO kaplanpy currently handles the following doctypes: • .docx • .odp • .ods • .odt • .txt • .xliff (very limited coverage) • .po
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    PyResParser

    PyResParser

    A simple resume parser used for extracting information from resumes

    PyResParser is a simple resume parser that extracts information from resumes, aiding in the automation of resume-processing tasks.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18

    Indexmeister

    automatic indexing for large LaTex documents

    Indexmeister reads a variety of formats (.tex, .docx, .epub, and others) and suggests keywords for indexing. The included program Imbrowse provides a semi-automatic interface to rapidly add index tags to multi-file latex documents.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    AI learning

    AI learning

    AiLearning, data analysis plus machine learning practice

    We actively respond to the Research Open Source Initiative (DOCX) . Open source today is not just open source, but datasets, models, tutorials, and experimental records. We are also exploring other categories of open source solutions and protocols. I hope you will understand this initiative, combine this initiative with your own interests, and do what you can. Everyone's tiny contributions, together, are the entire open source ecosystem.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Sistem Informasi Desa Kawasan

    Sistem Informasi Desa Kawasan

    Aplikasi Sistem Informasi Desa Dan Kawasan

    ...Untuk masuk kesetiap layanan : Nama : admin Password : admin Bila membutuhkan Source Code secara keseluruhan dapat menghubungi twitter @pythonesiaorg . Aplikasi ini dikembangkan dengan menggunakan : a. Python2.7 b. WxPython2.8 c. Python-Docx d. Jinja2 e. Docxtpl f. Openpyxl g. Pymysql
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    Data Ninja

    Data Ninja

    A document clustering system with search & report generation features

    ...The report generation feature specifically for use by audit companies takes an audit report as an input and outputs an insight log and draft management letter with insights pulled from the report. This feature can be customised to suit a company's requirements. This software works with pdf, docx, txt and csv files and the zip file must be saved in "My Documents".
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    Harmoni Search Engine Application

    Harmoni is search engine application written with Python

    ...Harmoni is easy to use application, and it can find any keywords fast. Harmoni has simple interface in “home”. User can search any string on some types of document (*.txt, *.html, *.docx, *.pptx, *.xlsx, *.pdf). There are two kinds of searching method which can be used to search the keyword; fragment word and whole word. Search mode: a. Fragment Word: Harmoni will search any fragments on the target files. For example, Harmoni will get result “Anda”, “Band”, “Hand”, and “Abandon” when the user types the keyword “And”. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    Harmoni Local Search Engine

    Python-Based Application

    Harmoni is Python-based application for searching any files on your PC. It can also search any keywords in some formats of document, such as; .txt, html, docx, xlsx, pptx, and pdf. Harmoni is a fast search Engine, it also supported by some tools; multi deleting, renaming, moving, and so fourth. It is just like google on your computer. It is recommended for anyone who are working with office documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    odf-converter-integrator is an easy way to open Microsoft Office 2007 files (also called Office Open XML, .docx, .xlsx, and .pptx) with a high-quality conversion on any Linux or Windows system in any OpenOffice.org.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    sort-photorec-datarecovery

    Sort PhotoRec files and pictures from a data recovery by date

    ...Useful for data recovery from hdd, RAID or memory cards where you get folders with mixed filetypes like from PhotoRec. Supports pictures (JPG, RAW formats) and office-documents (DOCX, DOC, XSLX, PDF, PPTX and more).
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next