Showing 171 open source projects for "docx"

View related business solutions
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    DOCX

    DOCX

    Generate .docx files with JS/TS with a nice declarative API

    Easily generate .docx files with JS/TS. Works for Node and on the Browser. You can mix and match whatever properties you want, or provide no properties. Setting the section type determines how the contents of the section will be placed relative to the previous section. You can specify additional properties to the section, by providing a properties attribute.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Pdf to Docx
    The program converts a PDF file to docx using Word (Microsoft Office) resources in a simple and practical way. File is manipulated locally and does not go through any API. The user just needs to have Microsoft Word installed from version 2013 above.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    docx2tex

    docx2tex

    Converts Microsoft Word docx to LaTeX

    Converts Microsoft Word's DOCX to LaTeX.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    carbone

    carbone

    Fast and simple report generator, from JSON to pdf, xslx, docx, odt

    Turn your JSON into PDF, DOCX, XLSX, PPTX, ODS and many more. Fast, Simple and Powerful report generator in any format PDF, DOCX, XLSX, ODT, PPTX, ODS, XML, CSV using templates and your JSON data as input.
    Downloads: 1 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    unioffice

    unioffice

    Pure go library for creating and processing Office Word documents

    unioffice is a library for creation of Office Open XML documents (.docx, .xlsx and .pptx). Its goal is to be the most compatible and highest-performance Go library for the creation and editing of docx/xlsx/pptx files. Every release of our libraries is automatically tested against known vulnerabilities and do not pass unless everything is remediated. All changes are carefully reviewed by our team. Our SDKs libraries are flexible and developed to solve common problems.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 6
    Savvy DOCX Recovery

    Savvy DOCX Recovery

    Open corrupt Word DOCX files and possibly recover formatting too.

    XML was designed from the beginning to be intolerant of errors. This decision adversely affects MS Word's corruption recovery. With one error in the document.xml subfile where all the DOCX file's text is stored, instead of a partial recovery, Word will stop and throw an error. Savvy DOCX Recovery attempts to do precise surgery on corrupt Word documents to reorder or excise bad XML tags. If this doesn't work, it uses the command line app xmllint first to attempt to repair corrupt XML subfiles of the DOCX. If this doesn't work a second attempt is made where the corrupt XML subfiles are truncated at the first error, and the correct ending tags are again added with xmllint. ...
    Leader badge
    Downloads: 168 This Week
    Last Update:
    See Project
  • 7
    DOCX Document Converter

    DOCX Document Converter

    Convert .docx to .md/.txt and .html. Free, unlimited, fast.

    A simple, free, unlimited, secure web-based tool that converts Microsoft Word documents (.docx) into Markdown (.md/.txt) and HTML files. Perfect for developers, writers, and anyone who needs to transform .docx MS Office Word documents into web-friendly or AI context friendly formats. Unlike those other jerks on the web that charge many dollars per month for this, I made it free, unlimited and open source. This is a better version of 'convert docx to txt' since .md files can be opened in notepad++ just the same AND they preserve formatting too! ...
    Downloads: 15 This Week
    Last Update:
    See Project
  • 8
    ONLYOFFICE Desktop Editors

    ONLYOFFICE Desktop Editors

    Office suite that combines text, spreadsheet and presentation editors

    ONLYOFFICE Desktop Editors is a free and open-source office suite that combines text, spreadsheet, and presentation editors, allowing users to create, view, and edit documents on Windows, Linux, and macOS platforms.
    Downloads: 40 This Week
    Last Update:
    See Project
  • 9
    MegaParse

    MegaParse

    File Parser optimised for LLM Ingestion with no loss

    MegaParse is a file parser optimized for Large Language Model (LLM) ingestion, ensuring no loss of information. It efficiently parses various document formats, such as PDFs, DOCX, and PPTX, converting them into formats ideal for processing by LLMs. This tool is essential for applications that require accurate and comprehensive data extraction from diverse document types.
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 10
    Desktop Commander MCP

    Desktop Commander MCP

    AI-powered MCP server for desktop file and terminal automation

    ...It allows users to run terminal commands with streaming output, manage long-running processes, and even execute code in memory without saving files. It also supports working with structured and document formats such as Excel, PDF, and DOCX, enabling AI to read, modify, and generate these files directly.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 11

    docx_cleanup

    free DOCX Cleanup - Embedded Font Remover - easy batch processing

    A lightweight, drag-and-drop freeware Windows utility that instantly reduces Microsoft Word (.docx) file sizes by stripping out heavy, embedded fonts without corrupting the document structure. Drag and Drop: No installation. Just drop a file or folder onto the .exe. Batch Processing: Automatically process an entire folder of .docx files (with a choice of including or excluding subfolders). 100% Safe for Word: Surgically removes the saved font files (if any) from inside the DOCX file while preserving your formatting and styles. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    Easy DataSet

    Easy DataSet

    A powerful tool for creating datasets for LLM fine-tuning

    Easy DataSet is a comprehensive open-source tool designed to make creating high-quality datasets for large language model fine-tuning, retrieval-augmented generation (RAG), and evaluation as easy and automated as possible by providing intuitive interfaces and powerful parsing, segmentation, and labeling tools. It supports ingesting domain-specific documents in a wide range of formats — including PDF, Markdown, DOCX, EPUB, and plain text — and can intelligently segment, clean, and structure content into rich datasets tailored for downstream LLM training needs. The system includes automated question-generation capabilities, hierarchical label trees, and answer generation pipelines that use LLM APIs to produce coherent paired data with customizable templates. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 13
    ONLYOFFICE DocumentServer

    ONLYOFFICE DocumentServer

    ONLYOFFICE Docs is a free collaborative online office suite

    ONLYOFFICE Document Server is an open-source office suite that enables users to create, edit, and collaborate on documents, spreadsheets, and presentations in real-time via a web-based interface.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 14
    rga

    rga

    rga: ripgrep, but also search in PDFs, E-Books, Office documents, etc.

    rga is a line-oriented search tool that allows you to look for a regex in a multitude of file types. rga wraps the awesome ripgrep and enables it to search in PDF, docx, sqlite, JPG, movie subtitles (mkv, mp4), etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    PanWriter

    PanWriter

    Markdown editor with pandoc integration and paginated preview

    PanWriter is a distraction-free markdown editor with two unique features. Tight integration with pandoc for import/export to/from plenty of file formats (including HTML, docx, LaTeX and EPUB). Preview pane that can show pages – including page breaks etc. Layout adjustments are immediately reflected in the preview. Instead, you can write text in Markdown, a convention on how to write plain text files. Basically, it’s like you would write an email.
    Downloads: 19 This Week
    Last Update:
    See Project
  • 16
    OnlyOffice Web

    OnlyOffice Web

    Perform common file preview and editing via the web

    ...It is designed with a privacy-first approach, ensuring that all document processing occurs locally in the browser, which prevents sensitive data from being uploaded or stored externally. The application supports a wide range of file formats, including DOCX, XLSX, PPTX, and CSV, making it versatile for handling common office documents. It offers a smooth and responsive editing experience, leveraging modern web technologies to replicate the functionality of traditional desktop office suites. The platform can also function as a Progressive Web App, allowing users to install it and work offline with full functionality. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    AnythingLLM

    AnythingLLM

    The all-in-one Desktop & Docker AI application with full RAG and AI

    A full-stack application that enables you to turn any document, resource, or piece of content into a context that any LLM can use as references during chatting. This application allows you to pick and choose which LLM or Vector Database you want to use as well as supporting multi-user management and permissions. AnythingLLM is a full-stack application where you can use commercial off-the-shelf LLMs or popular open-source LLMs and vectorDB solutions to build a private ChatGPT with no...
    Downloads: 142 This Week
    Last Update:
    See Project
  • 18
    LlamaParse

    LlamaParse

    Parse files for optimal RAG

    LlamaParse is a GenAI-native document parser that can parse complex document data for any downstream LLM use case (RAG, agents). Load in 160+ data sources and data formats, from unstructured, and semi-structured, to structured data (API's, PDFs, documents, SQL, etc.) Store and index your data for different use cases. Integrate with 40+ vector stores, document stores, graph stores, and SQL db providers.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 19
    DocsGPT

    DocsGPT

    Private AI platform for agents, enterprise search and RAG pipelines

    DocsGPT is an open-source AI platform for deploying private RAG pipelines, AI agents, and enterprise search on your own infrastructure. Connect any data source (PDFs, DOCX, CSV, Excel, HTML, audio, GitHub, databases, URLs) and get accurate, hallucination-free answers with source citations. Choose your LLM: OpenAI, Anthropic, Google Gemini, or local models. Works with Qdrant, MongoDB, and Elasticsearch and more. Deploy via Docker or Kubernetes with full data sovereignty. Build embeddable chat and search widgets, automate multi-step workflows with AI agents, and integrate via Slack, Telegram, Discord, or REST API. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    PageLM

    PageLM

    PageLM is a community driven version of NotebookLM

    ...The platform includes a broad set of learning tools such as contextual chat, Cornell-style note generation, flashcards, quizzes, AI podcasts, voice transcription, homework planning, exam simulation, debate practice, and a personalized study companion. It supports uploaded documents including PDF, DOCX, Markdown, and TXT, allowing users to ground questions and generated materials in source content. On the technical side, it supports multiple model providers, multiple embedding back ends, WebSocket streaming for real-time generation, persistent content storage, and structured markdown outputs.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 21
    PasteMD

    PasteMD

    Paste Markdown and AI responses into Word Excel instantly fast

    PasteMD is a lightweight desktop utility designed to streamline the process of transferring formatted content from the clipboard into office applications such as Word, WPS, and Excel. It primarily targets users who frequently copy content from AI chat tools or web pages and encounter formatting issues, especially with Markdown, tables, and LaTeX formulas. PasteMD operates from the system tray and monitors clipboard content, automatically converting Markdown or HTML into properly formatted...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 22
    ContextGem

    ContextGem

    ContextGem: Effortless LLM extraction from documents

    ContextGem is an open-source framework designed to simplify the extraction of structured data and insights from documents using large language models (LLMs). It provides a flexible, intuitive API that minimizes boilerplate code, enabling developers to build complex extraction workflows efficiently. ContextGem supports various document formats and integrates with multiple LLM providers, making it a versatile tool for tasks like contract analysis, anomaly detection, and information retrieval.​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    5ire

    5ire

    5ire is a cross-platform desktop AI assistant, MCP client

    5ire is a sleek, cross‑platform desktop AI assistant and MCP client that connects to major service providers, supports a local knowledge base and tool integration via MCP servers, enabling robust RAG and assistant features. These components are required as they constitute the runtime environment for the MCP Server. If you don't anticipate using the tools feature immediately, you may choose to skip this installation step and complete it later when the need arises. MCP is an open protocol that...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 24
    myGPTReader

    myGPTReader

    AI Slack bot for reading, summarizing, and chatting with content

    myGPTReader is an AI-powered Slack bot designed to help users read, summarize, and interact with various types of digital content through conversational interfaces. It enables users to quickly understand web pages, documents, and even video content by transforming them into interactive discussions rather than static reading experiences. myGPTReader supports a wide range of file formats, including eBooks, PDFs, and text-based documents, making it flexible for both casual and professional use...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    E2M

    E2M

    E2M converts various file types (doc, docx, epub, html, htm, url

    E2M is a SourceForge mirror of the e2m open-source project, which focuses on providing tools or services designed to convert or process content between different formats or systems. Projects with similar naming conventions typically emphasize automation workflows where input data from one environment is transformed into another representation or output structure. The mirrored repository allows users to access the project’s codebase independently from its original hosting platform while...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB