Showing 812 open source projects for "image to text"

View related business solutions
  • Our Free Plans just got better! | Auth0 by Okta Icon
    Our Free Plans just got better! | Auth0 by Okta

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your secuirty. Auth0 now, thank yourself later.
    Try free now
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 1
    EasyRPG Player

    EasyRPG Player

    RPG Maker 2000/2003 and EasyRPG games interpreter

    ... support (+ HarfBuzz for Unicode text shaping) mpg123 for better MP3 audio support. WildMIDI for better MIDI audio support. Libvorbis / Tremor for Ogg Vorbis audio support. opusfile for Opus audio support. libsndfile for better WAVE audio support. libxmp for better tracker music support. SpeexDSP for proper audio resampling.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    PromptAppGPT

    PromptAppGPT

    A rapid prompt app development framework based on GPT

    PromptAppGPT is a low-code prompt-based rapid app development framework. PromptAppGPT contains features such as low-code prompt-based development, GPT text generation, DALLE image generation, online prompt editer+compiler+runer, automatic user interface generation, support for plug-in extensions, etc. PromptAppGPT aims to enable natural language app development based on GPT.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    WindowTextExtractor

    WindowTextExtractor

    WindowTextExtractor allows you to get a text from any OS

    ...) for Windows 10, 11. Launch WindowTextExtractor on your system. Now simply drag the 'Find window' button to get a window information, text, password or image.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    HTTP Shortcuts for Android

    HTTP Shortcuts for Android

    Android app to create home screen shortcuts

    A simple Android app that allows you to create shortcuts that can be placed on your home screen. Each shortcut, when clicked, triggers an HTTP request, with the possibility to process and display the response in various ways. Run arbitrary JavaScript before & after execution, which allows for lots of customization with features such as compute values such as timestamps, random numbers, UUIDs, hashes, HMACs, base64, etc. Parse JSON or XML and extract data from them. Show toast message or...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Manage printing in a cost-efficient and eco-friendly way with Gelato. Icon
    Manage printing in a cost-efficient and eco-friendly way with Gelato.

    Gelato offers an extensive catalog of custom products, a zero-inventory business model, and free designing tools—all in one place.

    The world's largest print on demand network with 140+ production partners across 32 countries. Gelato offers end-to-end design, production and logistics for individuals looking to start their own business today!
    Sign up for Free
  • 5
    LinkChecker

    LinkChecker

    Check links in web documents or full websites

    LinkChecker is a free, GPL licensed website validator. LinkChecker checks links in web documents or full websites. It runs on Python 3 systems, requiring Python 3.8 or later. The version in the pip repository may be old, to find out how to get the latest code, plus platform-specific information and other advice see doc/install.txt in the source code archive. If you do not want to install any additional libraries/dependencies you can use the Docker image which is published on GitHub Packages.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    DevToys

    DevToys

    A Swiss Army knife for developers

    DevToys helps in daily tasks like formatting JSON, comparing text, testing RegExp. No need to use many untruthful websites to do simple tasks with your data. With Smart Detection, DevToys is able to detect the best tool that can treat the data you copied in the clipboard of your Windows. Compact overlay lets you keep the app in small and on top of other windows. Multiple instances of the app can be used at once. DevToys works entirely offline, meaning that none of the data used by the app goes...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    OpenFlamingo

    OpenFlamingo

    An open-source framework for training large multimodal models

    Welcome to our open source version of DeepMind's Flamingo model! In this repository, we provide a PyTorch implementation for training and evaluating OpenFlamingo models. We also provide an initial OpenFlamingo 9B model trained on a new Multimodal C4 dataset (coming soon). Please refer to our blog post for more details. This repo is still under development, and we hope to release better-performing and larger OpenFlamingo models soon. If you have any questions, please feel free to open an...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Node.js Client For NLP Cloud

    Node.js Client For NLP Cloud

    NLP Cloud serves high performance pre-trained or custom models

    This is the Node.js client (with Typescript types) for the NLP Cloud API. NLP Cloud serves high-performance pre-trained or custom models for NER, sentiment analysis, classification, summarization, dialogue summarization, paraphrasing, intent classification, product description and ad generation, chatbot, grammar and spelling correction, keywords and keyphrases extraction, text generation, image generation, blog post generation, text generation, question answering, automatic speech recognition...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    OpenAI Web Application

    OpenAI Web Application

    A web application that allows users to interact with OpenAI's models

    A web application that allows users to interact with OpenAI's modles through a simple and user-friendly interface. This app is for demo purpose to test OpenAI API and may contain issues/bugs. User-friendly interface for making requests to the OpenAI API. Responses are displayed in a chat-like format. Select Models (Davinci, Codex, DALL·E, Whisper) based on your needs. Create AI Images (DALL·E). Audio-Text Transcribe (Whisper). Highlight code syntax. Type in the input field and press enter...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Accounting Software for Small Businesses | Xero Icon
    Accounting Software for Small Businesses | Xero

    Save 90% for 6 months on Xero's award-winning accounting and online bookkeeping platform for businesses of all sizes and stages of growth.

    Xero offers a robust ecosystem of connected apps and integrations with banks and financial institutions, enabling small businesses to access a wide range of solutions within Xero's open platform to streamline operations and manage finances. Additionally, accounting and bookkeeping firms benefit from efficient compliance tools, advanced practice management software, and a cloud-based unified accounting ledger for all clients, centralized in one place.
    Get 90% off for 6 months
  • 10
    DocArray

    DocArray

    The data structure for multimodal data

    DocArray is a library for nested, unstructured, multimodal data in transit, including text, image, audio, video, 3D mesh, etc. It allows deep-learning engineers to efficiently process, embed, search, recommend, store, and transfer multimodal data with a Pythonic API. Door to multimodal world: super-expressive data structure for representing complicated/mixed/nested text, image, video, audio, 3D mesh data. The foundation data structure of Jina, CLIP-as-service, DALL·E Flow, DiscoArt etc. Data...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Onefetch

    Onefetch

    Git repository summary on your terminal

    Onefetch is a command-line Git information tool written in Rust that displays project information and code statistics for a local Git repository directly on your terminal. The tool is completely offline, no network access is required. By default, the repo's information is displayed alongside the dominant language's logo, but you can further configure onefetch to instead use an image, on supported terminals, text input, or nothing at all. It automatically detects open source licenses from texts...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Stable Diffusion in Docker

    Stable Diffusion in Docker

    Run the Stable Diffusion releases in a Docker container

    ... a suitable GPU you can set the options --device cpu and --onnx instead. Since it uses the model, you will need to create a user access token in your Huggingface account. Save the user access token in a file called token.txt and make sure it is available when building the container. Create an image from an existing image and a text prompt. Modify an existing image with its depth map and a text prompt.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    ChatGPT-Telegram-Workers

    ChatGPT-Telegram-Workers

    Deploy your own Telegram ChatGPT bot on Cloudflare Workers with ease

    The simplest and fastest way to deploy your own ChatGPT Telegram bot. Use Cloudflare Workers, single file, copy and paste directly, no dependencies required, no need to configure local development environment, no domain name required, serverless. You can customize the system initialization information so that your debugged personality never disappears.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    Extract TOTP/HOTP secrets

    Extract TOTP/HOTP secrets

    Extract one time password (OTP) secrets from QR codes

    The Python script extract_otp_secrets.py extracts one-time password (OTP) secrets from QR codes exported by two-factor authentication (2FA) apps such as "Google Authenticator".
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Unstructured.IO

    Unstructured.IO

    Open source libraries and APIs to build custom preprocessing pipelines

    The unstructured library provides open-source components for ingesting and pre-processing images and text documents, such as PDFs, HTML, Word docs, and many more. The use cases of unstructured revolve around streamlining and optimizing the data processing workflow for LLMs. unstructured modular bricks and connectors form a cohesive system that simplifies data ingestion and pre-processing, making it adaptable to different platforms and is efficient in transforming unstructured data...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Spectacle

    Spectacle

    A ReactJS based Presentation Library

    Spectacle is an open source presentation library based on React.js. It lets you create sleek presentations using JSX syntax, giving you the ability to live demo your code. Spectacle’s presentations are lively and interactive, with clickable elements, themes, custom backgrounds, animations, and many other fun and unique elements you can add to make your presentations pop. With Spectacle you can also live demo your code in real time without having to leave the presentation deck. Code blocks...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    DeepDetect

    DeepDetect

    Deep Learning API and Server in C++14 support for Caffe, PyTorch

    ... of image tagging, object detection, segmentation, OCR, Audio, Video, Text classification, CSV for tabular data and time series. Neural network templates for the most effective architectures for GPU, CPU, and Embedded devices. Training in a few hours and with small data thanks to 25+ pre-trained models. Full Open Source, with an ecosystem of tools (API clients, video, annotation, ...) Fast Server written in pure C++, a single codebase for Cloud, Desktop & Embedded.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    Feishu ChatGPT

    Feishu ChatGPT

    Voice dialogue, role-playing, multi-topic discussion, picture creation

    Feishu × (GPT-3.5 + DALL·E + Whisper) = flying-like work experience. Voice dialogue, role-playing, multi-topic discussion, picture creation, table analysis, document export. Golang language, it goes without saying! Master the gin framework proficiently, developing the backend is as natural as breathing! Familiar with the SDKs of DingTalk, Feishu, Qiwei and other platforms, and be able to develop and integrate a series of amazing functions! Proficient in platform-based detail thinking, let...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    Vimium C

    Vimium C

    A keyboard shortcut browser extension for keyboard-based navigation

    ... page). And it can map the same key sequence to different commands for different websites (and/or for different active elements on the page). It can copy any selected text and current tab's title and URL to the system clipboard, and read the clipboard to search the copied text using a specified search engine. It can also enable/disable websites' image loading and even JavaScript execution, if you trigger its command "toggleCS".
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    Venom

    Venom

    Venom is the most complete javascript library for Whatsapp

    Venom is a high-performance system developed with JavaScript to create a bot for WhatsApp, support for creating any interaction, such as customer service, media sending, sentence recognition based on artificial intelligence and all types of design architecture for WhatsApp. It's a high-performance alternative API to whatzapp, you can send, text messages, files, images, videos and more. Remember, the API was developed on a platform called RESTful Web services, providing interoperability between...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    Elastiknn

    Elastiknn

    Elasticsearch plugin for nearest neighbor search

    Elasticsearch plugin for nearest neighbor search. Store vectors and run similarity searches using exact and approximate algorithms. Methods like word2vec and convolutional neural nets can convert many data modalities (text, images, users, items, etc.) into numerical vectors, such that pairwise distance computations on the vectors correspond to semantic similarity of the original data. Elasticsearch is a ubiquitous search solution, but its support for vectors is limited. This plugin fills...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    dejavu

    dejavu

    The missing web UI for Elasticsearch

    ..., as a chrome extension or as a docker image. Starting v1.0, dejavu is the only Elasticsearch web UI that supports importing data via JSON and CSV files, as well as defining field mappings from the GUI. Starting with v3.0, we support the ability to connect to multiple indexes. You can also globally search across your indexes using global search bar.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    Nextra

    Nextra

    Simple, powerful and flexible site generation framework

    Simple, powerful, and flexible site generation framework with everything you love from Next.js. Nextra automatically converts Markdown links and images to use Next.js Link and Next.js Image when possible. No slow navigation or layout shift.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    ZLPhotoBrowser

    ZLPhotoBrowser

    Wechat-like image picker. Support select photos, videos, gif, etc.

    ZLPhotoBrowser is a Wechat-like image picker. Support select normal photos, videos, gif, and live photos. Support edit images and crop video. Image editor (Draw/Crop/Image sticker/Text sticker/Mosaic/Filter/Adjust(Brightness, Contrast, and Saturation)), (Draw color can be customized; Crop ratio can be customized; Filter effect can be customized; You can choose the editing tool you want). Customize the maximum number of previews or selection, the maximum and minimum optional duration...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Aphantasia

    Aphantasia

    CLIP + FFT/DWT/RGB = text to image/video

    This is a collection of text-to-image tools, evolved from the artwork of the same name. Based on CLIP model and Lucent library, with FFT/DWT/RGB parameterizes (no-GAN generation). Illustrip (text-to-video with motion and depth) is added. DWT (wavelets) parameterization is added. Check also colabs below, with VQGAN and SIREN+FFM generators. Tested on Python 3.7 with PyTorch 1.7.1 or 1.8. Generating massive detailed textures, a la deepdream, fullHD/4K resolutions and above, various CLIP models...
    Downloads: 0 This Week
    Last Update:
    See Project