Showing 4 open source projects for "docx2txt"

View related business solutions
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 1
    DocumentGrep

    DocumentGrep

    Search text or a regular expression in multiple documents

    This is a GUI for the command line tools grep, pdfgrep, pdftotext, unrtf, odt2txt, antiword,docx2txt, html2text and libreoffice. DocumentGrep search text in multiple files types. You can use regular expressions for the search (https://en.wikipedia.org/wiki/Regular_expression). This GUI and the command line tools work without indexing. Either the document is converted into text and processed by the RegExpr libary of Andrey V. Sorokin or handeled by the cli command itself (like pdfgrep). ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 2
    Corrupt PPTX Salvager

    Corrupt PPTX Salvager

    A corruption ignoring unzipper & regular expressions extract PPTX text

    ...Corrupt PPTX Salvager is based on PPTX to Text converter by Sopan Shewale. His project is hosted on Sourceforge. Sopan's project is further based on Sandeep Kumar's docx2txt which is also found here. This program was formerly known as PPTX Recovery and Corrupt PPTX2TXT.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Corrupt DOCX Salvager

    Corrupt DOCX Salvager

    Extract text from corrupt DOCX files where Word itself fails.

    Previously known as Damaged DOCX2TXT, this GUI program will extract text from damaged/corrupted Word 2007 - 2013 DOCX format documents. DOCX files are actually zipped collections of mostly XML files. The main text in docx files is found in document.xml file in the collection. Corrupt DOCX Salvager uses 7Zip, an unzipper that sometimes unzips partially corrupt document.xml files despite reporting an error.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    docx2txt

    docx2txt

    Perl based utility to extract formatted text content from MS Docx file

    Docx2txt is a Perl based command-line utility to convert (even corrupted) Microsoft docx documents to reasonably formatted text files, along with appropriate character conversions. Apart from Perl it also requires a command line unzipping program like unzip/7z/pkzipc/wzunzip.
    Leader badge
    Downloads: 66 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB