Showing 223 open source projects for "image text input"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    Intervention Image

    Intervention Image

    PHP Image Processing

    Intervention Image is a PHP image handling and manipulation library. It provides an easy-to-use interface for performing common image operations such as resizing, cropping, and applying filters. It supports a variety of image formats and can be integrated into Laravel projects or used independently in other PHP applications. The library is highly customizable, allowing for simple image manipulation tasks, or more advanced image processing workflows.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    ascii from image

    ascii from image

    Literally just an image -> ascii image generator

    Converts images/video to ascii art.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Real-ESRGAN GUI

    Real-ESRGAN GUI

    Cross-platform GUI for image upscaler Real-ESRGAN

    ...Real-ESRGAN can only enlarge the input image with a fixed 2-4x magnification (related to the selected model). This functionality is achieved by downsampling using a conventional scaling algorithm after multiple calls to Real-ESRGAN. Split each frame of the GIF and record the duration, zoom in one by one and then merge. Drag an image file or directory to any position in the window, and its path can be automatically set as the input.
    Downloads: 119 This Week
    Last Update:
    See Project
  • 4
    Welcome to node-html-to-image

    Welcome to node-html-to-image

    A Node.js module that generates images from HTML

    ...Additionally, it embarks Handlebars to provide a way to add logic to your HTML. When you install Puppeteer, it downloads a recent version of Chromium (~170MB Mac, ~282MB Linux, ~280MB Win) that is guaranteed to work with the API. node-html-to-image takes a screenshot of the body tag's content. If you want to set output image's resolution you need to set its dimension using CSS. Handlerbars is a templating language. It generates HTML from a template and an input object. In the following example we provide a template to node-html-to-image and a content object to fill the template.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 5
    Dream Textures

    Dream Textures

    Stable Diffusion built-in to Blender

    Create textures, concept art, background assets, and more with a simple text prompt. Use the 'Seamless' option to create textures that tile perfectly with no visible seam. Texture entire scenes with 'Project Dream Texture' and depth to image. Re-style animations with the Cycles render pass. Run the models on your machine to iterate without slowdowns from a service. Create textures, concept art, and more with text prompts.
    Downloads: 17 This Week
    Last Update:
    See Project
  • 6
    IOPaint

    IOPaint

    Image inpainting tool powered by SOTA AI Model

    ...Its feature set includes erasing people, watermarks, or defects, adding or replacing objects, applying text-aware edits, and extending images outward (outpainting) to fill contours or expand compositions.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 7
    ChatterBot

    ChatterBot

    Machine learning, conversational dialog engine for creating chat bots

    ...Additionally, the machine-learning nature of ChatterBot allows an agent instance to improve it’s own knowledge of possible responses as it interacts with humans and other sources of informative data. An untrained instance of ChatterBot starts off with no knowledge of how to communicate. Each time a user enters a statement, the library saves the text that they entered and the text that the statement was in response to. As ChatterBot receives more input the number of responses that it can reply increase.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    Tagify

    Tagify

    Lightweight, efficient Tags input component in Vanilla JS

    Transforms an input field or a textarea into a Tags component, in an easy, customizable way, with great performance and a small code footprint, exploded with features. Customizable HTML templates for the different areas of the component (wrapper, tags, dropdown, dropdown item, dropdown header, dropdown footer) Shows suggestions list (flexible settings & styling) at full (component) width or next to the typed texted (caret) Allows setting suggestions' aliases for easier fuzzy-searching....
    Downloads: 9 This Week
    Last Update:
    See Project
  • 9
    PhotoEditor

    PhotoEditor

    A Photo Editor library with simple, easy support for image editing

    A Photo Editor library with simple, easy support for image editing using Paints, Text, Filters, Emoji and Sticker like stories. Drawing on the image with the option to change its Brush's Color, Size, Opacity, Erasing and basic shapes. Apply Filter Effect on the image using MediaEffect. Adding/Editing Text with the option to change its Color with Custom Fonts. Adding Emoji with Custom Emoji Fonts.
    Downloads: 6 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 10
    SpeechRecognition

    SpeechRecognition

    Speech recognition module for Python

    Library for performing speech recognition, with support for several engines and APIs, online and offline. Recognize speech input from the microphone, transcribe an audio file, save audio data to an audio file. Show extended recognition results, calibrate the recognizer energy threshold for ambient noise levels (see recognizer_instance.energy_threshold for details). Listening to a microphone in the background, various other useful recognizer features. The easiest way to install this is using...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 11
    PlantUML

    PlantUML

    Generate diagrams from textual description

    ...The easiest way to test PlantUML is in an online solution that has PlantUML embedded, such as our online server. After testing, you may want to install PlantUML locally. Run (or have your software call) PlantUML, using sequenceDiagram.txt as input. The output is an image, which either appears in the other software, or is written to an image file on disk. Diagrams are defined using a simple and intuitive language. (see PlantUML Language Reference Guide). Images can be generated in PNG, in SVG or in LaTeX format. It is also possible to generate ASCII art diagrams (only for sequence diagrams).
    Downloads: 41 This Week
    Last Update:
    See Project
  • 12
    Mozc Devices

    Mozc Devices

    Circuit diagrams and firmware source code for Gboard DIY keyboards

    ...These devices creatively reinterpret how users can interact with Japanese text input, blending humor, engineering, and physical computing. The repository serves as an archive of the schematics, firmware, and PCB designs for these inventive input mechanisms, with many projects including promotional videos and technical references.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    sharp

    sharp

    High performance Node.js image processing module

    The typical use case for this high speed Node.js module is to convert large images in common formats to smaller, web-friendly JPEG, PNG, AVIF and WebP images of varying dimensions. Resizing an image is typically 4x-5x faster than using the quickest ImageMagick and GraphicsMagick settings due to its use of libvips. Colour spaces, embedded ICC profiles and alpha transparency channels are all handled correctly. Lanczos resampling ensures quality is not sacrificed for speed. As well as image resizing, operations such as rotation, extraction, compositing and gamma correction are available. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14

    Image To Text tools

    ITTT is a Free tool designed to Scan and extract Text from Images.

    Image To Text Tools is a 100% Free user-friendly tool designed to Scan and extract containing text in images into editable text formats. Whether you need to extract text from scanned documents, photographs, or other image files, Image To Text Tools provides accurate and reliable Optical Character Recognition (OCR) capabilities to meet your needs.
    Downloads: 29 This Week
    Last Update:
    See Project
  • 15
    PersonaLive

    PersonaLive

    Expressive Portrait Image Animation for Live Streaming

    PersonaLive is an open-source diffusion-based portrait animation framework focused on generating expressive, long-duration animated sequences in real time, primarily for live streaming or interactive applications. It leverages deep generative models that condition on a static reference image and a driving input (such as motion or expression cues) to produce a seamless animated portrait sequence that can run indefinitely without segmentation artifacts. The framework prioritizes low-latency and streamable output, making it suitable for real-time creative workflows, broadcast overlays, or interactive avatars on consumer-grade GPUs. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 16
    D2

    D2

    D2 is a modern diagram scripting language that turns text to diagrams

    D2 is a diagram scripting language that turns text to diagrams. It stands for Declarative Diagramming. Declarative, as in, you describe what you want diagrammed, it generates the image. As well, the functioning of the install script is described in detail to alleviate any concern of its use. We recommend using your OS's package manager directly instead for improved security but the install script is by no means insecure.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 17
    Laravolt Avatar

    Laravolt Avatar

    Plug n play avatar, turn name, email, and other strings into avatars

    Laravolt Avatar is a PHP library for generating customizable placeholder avatars using text initials. Often used in user profile systems, it creates square avatars with randomly assigned or custom background colors and font settings. The package is framework-agnostic but integrates especially well with Laravel projects.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Contour

    Contour

    Modern C++ Terminal Emulator

    ...Unicode: Emoji support (-: 🌈 💝 😛 👪 - including ZWJ, VS15, VS16 emoji :-) Unicode: Grapheme cluster support. Bold and italic fonts. High-DPI support. Vertical Line Markers (quickly jump to markers in your history!) Vi-like input modes for improved selection and copy'n'paste experience and Vi-like scrolloff feature. Blurred behind transparent background when using Windows 10 or KDE window manager on Linux. Blurrable Background image support. Runtime configuration reload. 256-color and Truecolor support. Key binding customization.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19
    Speakr

    Speakr

    Speakr is a personal, self-hosted web application

    Speakr is an open-source, real-time text-to-speech (TTS) web application that allows users to convert written text into natural-sounding speech in just a few clicks. It provides a clean, user-friendly interface where users can input text, choose a voice style or language, and immediately hear the output, making it ideal for accessibility, content creation, and learning applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    iCn3D

    iCn3D

    Web-based protein structure viewer and analysis tool

    "I see in 3D" (iCn3D) Structure Viewer is not only a web-based 3D viewer, but also a structure analysis tool interactively or in the batch mode using NodeJS scripts based on the npm package icn3d. iCn3D synchronizes the display of 3D structure, 2D interaction, and 1D sequences and annotations. Users' custom displays can be saved in a short URL or a PNG image. The complete package of iCn3D including Three.js and jQuery is in the directory "dist" after you get the source code with the "Code" button. View a 3D structure in iCn3D, input a PDB ID, and click "Load". You can also click the "File" menu to "Open File" or input other IDs. You can open a browser in your Virtual Reality (VR) headset and view a 3D structure in iCn3D. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 21
    Pokémon Cards CSS

    Pokémon Cards CSS

    Collection of advanced CSS styles to create realistic-looking effects

    pokemon-cards-css is a CSS-driven styling framework that lets web developers render Pokémon card visuals purely in HTML and CSS. It defines layouts, frames, typography, and image placeholders to mimic the look of real Pokémon trading cards, enabling users to create “virtual cards” with custom content. Because the design is built into CSS, cards respond to responsive constraints and adjust nicely across devices. The project supports common card types (basic, stage, trainer, etc.), and includes classes to manage energy symbols, attack boxes, and flavor text. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22
    Fabric.js

    Fabric.js

    Javascript Canvas Library and SVG-to-Canvas Parser

    Fabric.js is a simple yet powerful Javascript HTML5 canvas library that allows you to easily work with HTML5 canvas element in various ways. It is also an SVG-to-canvas (and vice versa) parser. Fabric provides an interactive object model on top of canvas element, so you can create and populate objects on canvas; manipulate the size, position and rotation of these objects; modify properties such as color, transparency and more. You could also group these objects together with just a simple...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 23
    Moshi

    Moshi

    A speech-text foundation model for real time dialogue

    ...Moshi models two streams of audio: one corresponds to Moshi, and the other one to the user. At inference, the stream from the user is taken from the audio input, and the one for Moshi is sampled from the model's output. Along these two audio streams, Moshi predicts text tokens corresponding to its own speech, its inner monologue, which greatly improves the quality of its generation. A small Depth Transformer models inter codebook dependencies for a given time step, while a large, 7B parameter Temporal Transformer models the temporal dependencies.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    Podcastfy.ai

    Podcastfy.ai

    Transforming Multimodal Content into Captivating Multilingual Audio

    Podcastfy is an open-source Python package that transforms multi-modal content (text, images) into engaging, multi-lingual audio conversations using GenAI. Input content includes websites, PDFs, youtube videos as well as images. Unlike UI-based tools focused primarily on note-taking or research synthesis (e.g. NotebookLM), Podcastfy focuses on the programmatic and bespoke generation of engaging, conversational transcripts and audio from a multitude of multi-modal sources enabling customization and scale.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    PersonaPlex

    PersonaPlex

    PersonaPlex code

    ...PersonaPlex also supports persona and voice control, allowing developers to define the role and speaking style of the agent using text prompts and voice conditioning, making it suitable for applications like customized voice assistants, interactive character agents, or domain-specific conversational tools. Internally, it processes continuous audio streams in a hybrid input format so that speech understanding and generation occur jointly.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB