Showing 14 open source projects for "pdf ocr windows"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 1
    Self-Operating Computer

    Self-Operating Computer

    A framework to enable multimodal models to operate a computer

    ...Notably, it was the first known project to implement a multimodal model capable of viewing and controlling a computer screen. The framework supports features like Optical Character Recognition (OCR) and Set-of-Mark (SoM) prompting to enhance visual grounding capabilities. It is designed to be compatible with macOS, Windows, and Linux (with X server installed), and is released under the MIT license.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    Comandi Vocali Offline per Windows

    Comandi Vocali Offline per Windows

    Sistema comandi vocali offline per Windows, veloce e privato .Offline

    ... 👉 Nuova versione funzionante: https://voicecommander2multilingual.sourceforge.io/ o scaricala direttamente - direct download : https://sourceforge.net/projects/voicecommander2multilingual/files/VoiceCommander2.zip/download VoiceCommander 2.0 è stabile, migliorato e completamente operativo. Comandi Vocali Offline per Windows è un sistema di controllo vocale che funziona interamente in locale sul tuo PC. Permette di controllare il computer con la voce senza connessione internet, senza cloud e senza inviare dati all’esterno. Il sistema è progettato per garantire massima privacy, velocità e semplicità. Caratteristiche principali: - Funziona completamente offline (nessun server, nessun cloud) - Riconoscimento vocale veloce con modelli locali - Controllo di browser, programmi e sistema - Lettura dello schermo tramite OCR e sintesi vocale - Installazione semplice senza modifiche al registro - Portabile e removibile
    Leader badge
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    Open CoDesign

    Open CoDesign

    Open-source Claude Design alternative

    Open CoDesign is an open-source, desktop AI design tool that transforms natural language prompts into fully structured design artifacts such as prototypes, slide decks, and marketing assets. It is designed as a local-first alternative to cloud-based design tools, allowing users to run everything on their own machine while bringing their own AI model and API keys. The system supports multiple model providers and integrates directly with existing developer tools, enabling seamless workflows...
    Downloads: 234 This Week
    Last Update:
    See Project
  • 4
    Open Design

    Open Design

    Local-first, open-source alternative to Anthropic's Claude Design

    Open Design is a local-first, open-source AI design platform that enables coding agents to generate complete design systems and visual artifacts from prompts. It functions as an alternative to proprietary AI design tools by allowing users to connect their own models and run everything locally or deploy it as a web application. The system includes a library of design skills and brand-grade design systems that guide the generation process, ensuring consistency and quality. It integrates with...
    Downloads: 68 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    AnythingLLM

    AnythingLLM

    The all-in-one Desktop & Docker AI application with full RAG and AI

    A full-stack application that enables you to turn any document, resource, or piece of content into a context that any LLM can use as references during chatting. This application allows you to pick and choose which LLM or Vector Database you want to use as well as supporting multi-user management and permissions. AnythingLLM is a full-stack application where you can use commercial off-the-shelf LLMs or popular open-source LLMs and vectorDB solutions to build a private ChatGPT with no...
    Downloads: 96 This Week
    Last Update:
    See Project
  • 6
    Dify

    Dify

    One API for plugins and datasets, one interface for prompt engineering

    Dify is an easy-to-use LLMOps platform designed to empower more people to create sustainable, AI-native applications. With visual orchestration for various application types, Dify offers out-of-the-box, ready-to-use applications that can also serve as Backend-as-a-Service APIs. Unify your development process with one API for plugins and datasets integration, and streamline your operations using a single interface for prompt engineering, visual analytics, and continuous improvement....
    Downloads: 28 This Week
    Last Update:
    See Project
  • 7
    DB-GPT

    DB-GPT

    Revolutionizing Database Interactions with Private LLM Technology

    DB-GPT is an experimental open-source project that uses localized GPT large models to interact with your data and environment. With this solution, you can be assured that there is no risk of data leakage, and your data is 100% private and secure.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    5ire

    5ire

    5ire is a cross-platform desktop AI assistant, MCP client

    5ire is a sleek, cross‑platform desktop AI assistant and MCP client that connects to major service providers, supports a local knowledge base and tool integration via MCP servers, enabling robust RAG and assistant features. These components are required as they constitute the runtime environment for the MCP Server. If you don't anticipate using the tools feature immediately, you may choose to skip this installation step and complete it later when the need arises. MCP is an open protocol that...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9
    LLMStack

    LLMStack

    No-code multi-agent framework to build LLM Agents, workflows

    LLMStack is a no-code platform for building generative AI agents, workflows and chatbots, connecting them to your data and business processes. Build tailor-made generative AI agents, applications and chatbots that cater to your unique needs by chaining multiple LLMs. Seamlessly integrate your own data, internal tools and GPT-powered models without any coding experience using LLMStack's no-code builder. Trigger your AI chains from Slack or Discord. Deploy to the cloud or on-premise.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Your monitoring isn't a stack. It's a pile. Fix that. Icon
    Your monitoring isn't a stack. It's a pile. Fix that.

    Errors, performance, logs, uptime. One install, one invoice, one UI.

    Replace Datadog, New Relic, and Sentry without adding three more dashboards.
    Free 30 days.
  • 10
    Ai-Assistant

    Ai-Assistant

    Open-source novel writing & AI coding assistant aggregating top models

    This is an open‑source, powerful novel‑writing and AI programming assistant with the following core strengths: Model Aggregation: Natively supports the latest DeepSeek and seamlessly integrates with top‑tier models such as Gemini, Claude, GPT, Tongyi Qianwen, Kimi, and others—both domestic and international—delivering a one‑stop intelligent experience. Multimodal Capability: Accurately interprets images and PDF content, and supports invoking advanced models for high‑quality...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    LangChain Apps on Production with Jina

    LangChain Apps on Production with Jina

    Langchain Apps on Production with Jina & FastAPI

    Jina is an open-source framework for building scalable multi-modal AI apps on Production. LangChain is another open-source framework for building applications powered by LLMs. long-chain-serve helps you deploy your LangChain apps on Jina AI Cloud in a matter of seconds. You can benefit from the scalability and serverless architecture of the cloud without sacrificing the ease and convenience of local development. And if you prefer, you can also deploy your LangChain apps on your own...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    nunn

    nunn

    This is an implementation of a machine learning library in C++17

    nunn is a collection of ML algorithms and related examples written in modern C++17.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    MARS

    MARS

    Multi Agent Roundbased Simulator

    MARS (Multi Agent Roundbased Simulator) is a simulator for Multi Agent systems written in java. It sets up on the eclipse platform and is realized as a set of plugins. It was started as a project-group at University of Paderborn in 2010. At the moment there is a second project-group using MARS. The german documentation of first group can be found at http://www.cs.uni-paderborn.de/fileadmin/Informatik/AG-Kleine-Buening/files/ws11/pg-agents-2/Abschlussdoku-pg-1.pdf If you want...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 14
    ANts P2P
    ANts P2P realizes a third generation P2P net. It protects your privacy while you are connected and makes you not trackable, hiding your identity (ip) and crypting everything you are sending/receiving from others.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
Auth0 Logo