103 projects for "image text input" with 2 filters applied:

  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    image-blaster

    image-blaster

    An image-to-world skillset for Claude

    image-blaster is an image-to-world skillset that turns a single input image into a richer 3D production starting point. It uses Claude skills together with external generation services to create 3D environments, object meshes, Gaussian splats, and sound effects. The project is designed to accelerate early-stage 3D work by producing usable assets from visual references in just a few guided steps.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Speakr

    Speakr

    Speakr is a personal, self-hosted web application

    Speakr is an open-source, real-time text-to-speech (TTS) web application that allows users to convert written text into natural-sounding speech in just a few clicks. It provides a clean, user-friendly interface where users can input text, choose a voice style or language, and immediately hear the output, making it ideal for accessibility, content creation, and learning applications.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    PersonaPlex

    PersonaPlex

    PersonaPlex code

    ...PersonaPlex also supports persona and voice control, allowing developers to define the role and speaking style of the agent using text prompts and voice conditioning, making it suitable for applications like customized voice assistants, interactive character agents, or domain-specific conversational tools. Internally, it processes continuous audio streams in a hybrid input format so that speech understanding and generation occur jointly.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    Pokémon Cards CSS

    Pokémon Cards CSS

    Collection of advanced CSS styles to create realistic-looking effects

    pokemon-cards-css is a CSS-driven styling framework that lets web developers render Pokémon card visuals purely in HTML and CSS. It defines layouts, frames, typography, and image placeholders to mimic the look of real Pokémon trading cards, enabling users to create “virtual cards” with custom content. Because the design is built into CSS, cards respond to responsive constraints and adjust nicely across devices. The project supports common card types (basic, stage, trainer, etc.), and includes classes to manage energy symbols, attack boxes, and flavor text. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 5
    ML Sharp

    ML Sharp

    Sharp Monocular View Synthesis in Less Than a Second

    ML Sharp is a research code release that turns a single 2D photograph into a photorealistic 3D representation that can be rendered from nearby viewpoints. Instead of requiring multi-view input, it predicts the parameters of a 3D Gaussian scene representation directly from one image using a single forward pass through a neural network. The core idea is speed: the 3D representation is produced in under a second on a standard GPU, and then the resulting scene can be rendered in real time to generate new views interactively. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    AnimateDiff

    AnimateDiff

    Plug-n-play module turning text-to-image models into animation

    AnimateDiff is an open-source project designed to enhance text-to-image diffusion models by adding animation capabilities. It allows users to turn static images generated by popular text-to-image models into animated sequences without requiring additional model training. This plug-and-play tool is compatible with a wide range of community models and facilitates the generation of animation directly from pre-existing text-to-image models. ...
    Leader badge
    Downloads: 30 This Week
    Last Update:
    See Project
  • 7
    Provides optical character recognition (OCR) solutions for Vietnamese language.
    Leader badge
    Downloads: 146 This Week
    Last Update:
    See Project
  • 8
    dktools - Dirk Krauses tools

    dktools - Dirk Krauses tools

    Drawing, graphics conversion, software development, administration.

    GUI and command line tools for advanced users and administrators: wxdkdraw - Minimalistic drawing application for use with LaTeX, wxd2lat - Convert wxdkdraw files to LaTeX, bitmap2pp - Convert PNG/JPEG/TIFF/NetPBM to (E)PS or PDF, fig2lat - Convert XFig files to LaTeX, htmlbook - publish HTML like a book, dkcpre - C debugging and tracing preprocessor, itadmin - manage your IT using a MySQL/MariaDB database, dk-fic - file integrity checker, dk-ls - list files, output column order is...
    Leader badge
    Downloads: 6 This Week
    Last Update:
    See Project
  • 9
    simple3d

    simple3d

    realistic driving simulation + functions for 2D/3D graphics

    ...Simcar - driving (not racing) simulation, with stunts and realistic physics since version 5.0.0. GNU/Linux and Wind0w$ executables are available. SDL_grf - functions for 2D/3D graphics (including text), sound and input + a few programs for viewing 3D models, viewing ZX Spectrum *.scr files, simulating the Solar System etc. Simple3d - old program for rendering 3D models, now included in SDL_grf but not used in the latest programs.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • 10
    MLT Multimedia Framework
    A multimedia authoring and processing framework and a video playout server for television broadcasting.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 11
    JMP3Renamer
    JMP3Renamer is a plugin-based renamer/tagger written in Java. It supports automatical assignment of the data to the files and magic cookies to specify the filename format. Currently available plugins: Discogs, MusicBrainz, Filename, Filetag, Mp3, Ogg
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    fileaxy

    fileaxy

    Fileaxy does file sync, de-duplication, image matching & bulk preview

    Fileaxy is a file de-duplication, organization, synchronization, and bulk previewing tool which utilizes a new user interface for local file management. Using content hashing or machine vision algorithms, Fileaxy can detect identical files as well as similar names, images, videos, or fonts and correlate those to others based on naming conventions. Optionally integrates with ImageMagick, GraphicsMagick, FFmpeg, and Mac Sips file decoding with a simple button click. Fileaxy opens NO network...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Pixelitor

    Pixelitor

    A Java image editor

    Pixelitor is a cross-platform raster graphics editor written in Java. It supports layers, layer masks, text layers, drawing, multiple undo, etc. It has more than 80 image filters and color adjustments, some of which are unique.
    Downloads: 50 This Week
    Last Update:
    See Project
  • 14
    Motionity

    Motionity

    The web-based motion graphics editor for everyone

    ...It also supports animated text effects (fade, scale, type-writer), and can incorporate vector-based animations or Lottie animations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Perceptron

    Perceptron

    The birth of modern video feedback art.

    Perceptron is a video feedback engine with a variety of extraordinary graphical effects. Perceptron is an endless flow of transforming visuals. Perceptron * recursively transforms images and video streams in realtime and produces a combination of Julia fractals, IFS fractals, and chaotic patterns due to video feedback * evolves geometric patterns into the realm of infinite details and deepens the thought * records animations (movies) * saves and opens presets...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    gImageReader

    gImageReader

    A graphical frontend to tesseract-ocr

    ...Features include: - Import PDF documents and images from disk, scanning devices, clipboard and screenshots - Process multiple images and documents in one go - Manual or automatic recognition area definition - Recognize to plain text or to hOCR documents - Recognized text displayed directly next to the image - Post-process the recognized text, including spellchecking - Generate PDF documents from hOCR documents **Note**: This page is only a mirror for the downloads. Development is happening on github at https://github.com/manisandro/gImageReader, release binaries are also posted there.
    Leader badge
    Downloads: 105 This Week
    Last Update:
    See Project
  • 17
    Marzipano

    Marzipano

    A 360° media viewer for the modern web

    Marzipano is a 360° media viewer designed for the modern web, allowing developers to display panoramic images and videos interactively with smooth performance and responsive controls. Built using HTML5, CSS3, and JavaScript, it supports multi-resolution tiling and optimized rendering to deliver efficient, high-quality experiences even with very large panoramas. The viewer can be easily embedded into web applications, offering controls for zooming, panning, and navigating between scenes....
    Downloads: 10 This Week
    Last Update:
    See Project
  • 18
    Render32

    Render32

    Command-line video compositing and audio mixing tools

    Render is a program for creating composite BMP image sequences. These images are composited as specified in a text configuration file. Mixer is a program for mixing film soundtracks. It accepts input files in WAV format and outputs a mixed soundtrack in WAV format. Each input channel can contain one or more audio files that are edited and mixed using a cue sheet. The maximum number of channels is a compile-time parameter.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Primitive Pictures

    Primitive Pictures

    Reproducing images with geometric primitives

    ...Over time (e.g., adding 50-200 shapes) the output becomes an abstracted version of the input image—recognizable, artistic, and often visually striking. The tool supports input resizing, different shape modes (triangle, rectangle, circle, bezier, polygon), alpha blending, and can output PNG, JPG, SVG, or even animated GIFs that show the incremental build process. Because the Go code is self-contained and efficient, users can run it locally on images, script batch runs, or use it to generate “primitive art” or stylized thumbnails.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    BRIC

    BRIC

    BRIC is a powerful tool for batch image processing.

    Bric is a cross-platform batch image processor. You can convert, resize, rotate and add watermark to your images. Multiple file types are supported for input and output. The project started back in 2011 and was maintained for a couple of years. In 2020 BRIC is again in active development, so some of the features written below might be outdated. Please be patient, until everything is reviewed and rewritten.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    Boss Engine

    8-bit Graphical API for Java

    Boss Engine is a graphical API for writing low-resolution 8-bit color applications. It offers easy to use functions for handling graphics, basic i/o and more. Screen access is performed through virtual framebuffers that allow direct read/write access to indexed color screen data. API is coded entirely with native libraries. No third party libraries required.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    GIF for CLI

    GIF for CLI

    Takes in a GIF, short video, or a query to the Tenor GIF API

    gif-for-cli is a small, playful utility that brings animated GIFs to the command line by rendering frames directly in a terminal. It takes an input GIF (or a URL) and converts each frame into a terminal-friendly representation, timing updates to approximate the original animation. Depending on terminal capabilities, it can use ANSI color blocks or image protocols to achieve surprisingly faithful playback. The tool includes conveniences such as looping control, scaling to fit your terminal, and caching to avoid repeated downloads. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Java Games and Graphics is meant to be a fun project. The goal here is to have fun with JAVA graphics and JAVA games. There is no specific goal as to what kind of graphics or games.This project features an RPG Role Playing, MUD, and Turn Based Combat gam
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    FolioReaderKit

    FolioReaderKit

    A Swift ePub reader and parser framework for iOS

    ...Go to your storyboard file, choose or create the view controller that should present the epub reader. In the identity, the inspector set StoryboardFolioReaderContrainer as a class. Media Overlays (Sync text rendering with audio playback). TTS - Text to Speech Support, parse epub cover image, RTL Support. Vertical or/and Horizontal scrolling, share Custom Image Quotes NEW, supports multiple instances at same time, like parallel reading.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    VideoMan Library

    VideoMan Library

    C++ library for image acquisition and visualization

    Library for capturing video from cameras, 3d sensors, frame-grabbers, video files and image sequences. It can also display multiple images using OpenGL with different layouts. Easy integration with OpenCV, CUDA... Perfect for computer vision. Keywords: video capture, computer vision, machine vision, opencv, opengl, cameras, video input devices, firewire, usb, gige
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB