image text input free download

147 projects for "image text input" with 2 filters applied:

Multimedia BSD Clear Filters & Widen Search

Streamline Azure Security with Palo Alto Networks VM-Series
Centrally manage physical and virtualized firewalls with Panorama

Improve your security posture and reduce incident response time. Use the VM-Series to natively analyze Azure traffic and dynamically drive policy updates based on workload changes.

Learn more
Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
1

image-blaster

An image-to-world skillset for Claude

image-blaster is an image-to-world skillset that turns a single input image into a richer 3D production starting point. It uses Claude skills together with external generation services to create 3D environments, object meshes, Gaussian splats, and sound effects. The project is designed to accelerate early-stage 3D work by producing usable assets from visual references in just a few guided steps.

Downloads: 0 This Week

Last Update: 1 day ago
See Project
2

Speakr

Speakr is a personal, self-hosted web application

Speakr is an open-source, real-time text-to-speech (TTS) web application that allows users to convert written text into natural-sounding speech in just a few clicks. It provides a clean, user-friendly interface where users can input text, choose a voice style or language, and immediately hear the output, making it ideal for accessibility, content creation, and learning applications.

Downloads: 1 This Week

Last Update: 2026-05-09
See Project
3

PersonaPlex

PersonaPlex code

...PersonaPlex also supports persona and voice control, allowing developers to define the role and speaking style of the agent using text prompts and voice conditioning, making it suitable for applications like customized voice assistants, interactive character agents, or domain-specific conversational tools. Internally, it processes continuous audio streams in a hybrid input format so that speech understanding and generation occur jointly.

Downloads: 2 This Week

Last Update: 2026-03-02
See Project
4

Pokémon Cards CSS

Collection of advanced CSS styles to create realistic-looking effects

pokemon-cards-css is a CSS-driven styling framework that lets web developers render Pokémon card visuals purely in HTML and CSS. It defines layouts, frames, typography, and image placeholders to mimic the look of real Pokémon trading cards, enabling users to create “virtual cards” with custom content. Because the design is built into CSS, cards respond to responsive constraints and adjust nicely across devices. The project supports common card types (basic, stage, trainer, etc.), and includes classes to manage energy symbols, attack boxes, and flavor text. ...

Downloads: 0 This Week

Last Update: 2025-12-15
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
5

ML Sharp

Sharp Monocular View Synthesis in Less Than a Second

ML Sharp is a research code release that turns a single 2D photograph into a photorealistic 3D representation that can be rendered from nearby viewpoints. Instead of requiring multi-view input, it predicts the parameters of a 3D Gaussian scene representation directly from one image using a single forward pass through a neural network. The core idea is speed: the 3D representation is produced in under a second on a standard GPU, and then the resulting scene can be rendered in real time to generate new views interactively. ...

Downloads: 0 This Week

Last Update: 2026-01-29
See Project
6

AnimateDiff

Plug-n-play module turning text-to-image models into animation

AnimateDiff is an open-source project designed to enhance text-to-image diffusion models by adding animation capabilities. It allows users to turn static images generated by popular text-to-image models into animated sequences without requiring additional model training. This plug-and-play tool is compatible with a wide range of community models and facilitates the generation of animation directly from pre-existing text-to-image models. ...

1 Review

Downloads: 30 This Week

Last Update: 2025-03-06
See Project
7

VietOCR

Provides optical character recognition (OCR) solutions for Vietnamese language.

24 Reviews

Downloads: 146 This Week

Last Update: 2026-01-17
See Project
8

dktools - Dirk Krauses tools

Drawing, graphics conversion, software development, administration.

GUI and command line tools for advanced users and administrators: wxdkdraw - Minimalistic drawing application for use with LaTeX, wxd2lat - Convert wxdkdraw files to LaTeX, bitmap2pp - Convert PNG/JPEG/TIFF/NetPBM to (E)PS or PDF, fig2lat - Convert XFig files to LaTeX, htmlbook - publish HTML like a book, dkcpre - C debugging and tracing preprocessor, itadmin - manage your IT using a MySQL/MariaDB database, dk-fic - file integrity checker, dk-ls - list files, output column order is...

Downloads: 6 This Week

Last Update: 2026-04-28
See Project
9

Snowmix

Video mixer for mixing live and recorded video and audio feeds

...Control over both CLI and a TCP connections. Video input and outputs can be done through GStreamer pipelines or the GStreamer shmsrc/shmsink API. Supported for Ubuntu, Mint, Debian, Alma, CentOS, EndeavourOS, Fedora, Mageia, Manjaro, MX Linux, OpenSUSE, RHEL, Rocky and macOS/OS X. Free support in the discussion forum. See Snowmix in action on Youtube http://www.youtube.com/user/Snowmix4video

10 Reviews

Downloads: 7 This Week

Last Update: 5 days ago
See Project
Go From AI Idea to AI App Fast
One platform to build, fine-tune, and deploy ML models. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.

Try Free
10

simple3d

realistic driving simulation + functions for 2D/3D graphics

...Simcar - driving (not racing) simulation, with stunts and realistic physics since version 5.0.0. GNU/Linux and Wind0w$ executables are available. SDL_grf - functions for 2D/3D graphics (including text), sound and input + a few programs for viewing 3D models, viewing ZX Spectrum *.scr files, simulating the Solar System etc. Simple3d - old program for rendering 3D models, now included in SDL_grf but not used in the latest programs.

1 Review

Downloads: 1 This Week

Last Update: 2026-01-28
See Project
11

MLT Multimedia Framework

A multimedia authoring and processing framework and a video playout server for television broadcasting.

17 Reviews

Downloads: 9 This Week

Last Update: 2026-04-22
See Project
12

JMP3Renamer

JMP3Renamer is a plugin-based renamer/tagger written in Java. It supports automatical assignment of the data to the files and magic cookies to specify the filename format. Currently available plugins: Discogs, MusicBrainz, Filename, Filetag, Mp3, Ogg

Downloads: 0 This Week

Last Update: 2024-07-14
See Project
13

fileaxy

Fileaxy does file sync, de-duplication, image matching & bulk preview

Fileaxy is a file de-duplication, organization, synchronization, and bulk previewing tool which utilizes a new user interface for local file management. Using content hashing or machine vision algorithms, Fileaxy can detect identical files as well as similar names, images, videos, or fonts and correlate those to others based on naming conventions. Optionally integrates with ImageMagick, GraphicsMagick, FFmpeg, and Mac Sips file decoding with a simple button click. Fileaxy opens NO network...

Downloads: 0 This Week

Last Update: 18 hours ago
See Project
14

ARITA

Extraordinary audio player for FreeBSD & GNU/Linux

...As for 'cuesheets': tracks are merged into a single continuous audio file and a supplementary text file, which provides information on where tracks start and end.

Downloads: 0 This Week

Last Update: 2025-09-06
See Project
15

Pixelitor

A Java image editor

Pixelitor is a cross-platform raster graphics editor written in Java. It supports layers, layer masks, text layers, drawing, multiple undo, etc. It has more than 80 image filters and color adjustments, some of which are unique.

11 Reviews

Downloads: 50 This Week

Last Update: 2023-09-06
See Project
16

AA project

AA means Ascii Art - the AAlib (ascii art GFX library), BB (audiovisual demonstration for your terminal), aview (image browser/animation player), AAvga (SVGAlib wrapper for AA-lib), ttyquake (text mode quake), aa3d (random dot stereogram generator)...

7 Reviews

Downloads: 242 This Week

Last Update: 2023-04-14
See Project
17

Motionity

The web-based motion graphics editor for everyone

...It also supports animated text effects (fade, scale, type-writer), and can incorporate vector-based animations or Lottie animations.

Downloads: 0 This Week

Last Update: 2025-12-02
See Project
18

Rayshade

Rayshade raytracer

now gnu-ized, gcc-4.7.4 or gcc-10.2.0 A raytracer does not require custom code for (shadows, mirrors) like GL: it uses physics to simulate light to make realistic images, leaving one to specify only what is in the scene. (at a cost of speed) Rayshade is a 1990's raytracer, a great one back then (and still useful). Rayshade has an excellent easy to read yet informative User's Guide that other's could not help but copy from. (html of guide is in...

Downloads: 0 This Week

Last Update: 2022-11-23
See Project
19

Cyan

Prepress image viewer and converter

Cyan is an open source cross-platform image viewer and converter, designed for prepress (print) work. Like converting an image from RGB to CMYK, or the other way around. Cyan supports color profiles complying with the International Color Consortium (ICC) standard, and strives to create as color-accurate images as possible, with support for RGB, CMYK and GRAY with up to 32-bit image depth.

2 Reviews

Downloads: 104 This Week

Last Update: 2022-03-24
See Project
20

Perceptron

The birth of modern video feedback art.

Perceptron is a video feedback engine with a variety of extraordinary graphical effects. Perceptron is an endless flow of transforming visuals. Perceptron * recursively transforms images and video streams in realtime and produces a combination of Julia fractals, IFS fractals, and chaotic patterns due to video feedback * evolves geometric patterns into the realm of infinite details and deepens the thought * records animations (movies) * saves and opens presets...

3 Reviews

Downloads: 0 This Week

Last Update: 2022-04-12
See Project
21

ExiFlow

A set of tools (command line and GUI) to provide a complete digital photo workflow for Unixes. EXIF headers are used as the central information repository, so users may change their software at any time without loosing any data.

1 Review

Downloads: 0 This Week

Last Update: 2022-04-13
See Project
22

gImageReader

A graphical frontend to tesseract-ocr

...Features include: - Import PDF documents and images from disk, scanning devices, clipboard and screenshots - Process multiple images and documents in one go - Manual or automatic recognition area definition - Recognize to plain text or to hOCR documents - Recognized text displayed directly next to the image - Post-process the recognized text, including spellchecking - Generate PDF documents from hOCR documents **Note**: This page is only a mirror for the downloads. Development is happening on github at https://github.com/manisandro/gImageReader, release binaries are also posted there.

27 Reviews

Downloads: 105 This Week

Last Update: 2022-01-28
See Project
23

Imaginary Teleprompter

Free teleprompter software

Free teleprompter software. Built with web technologies so its easy to customize. Features include: mirroring, dual-screen support, rich text editing, image support, custom styles, and auto-save.

4 Reviews

Downloads: 256 This Week

Last Update: 2026-04-25
See Project
24

Marzipano

A 360° media viewer for the modern web

Marzipano is a 360° media viewer designed for the modern web, allowing developers to display panoramic images and videos interactively with smooth performance and responsive controls. Built using HTML5, CSS3, and JavaScript, it supports multi-resolution tiling and optimized rendering to deliver efficient, high-quality experiences even with very large panoramas. The viewer can be easily embedded into web applications, offering controls for zooming, panning, and navigating between scenes....

Downloads: 10 This Week

Last Update: 2 hours ago
See Project
25

Render32

Command-line video compositing and audio mixing tools

Render is a program for creating composite BMP image sequences. These images are composited as specified in a text configuration file. Mixer is a program for mixing film soundtracks. It accepts input files in WAV format and outputs a mixed soundtrack in WAV format. Each input channel can contain one or more audio files that are edited and mixed using a cue sheet. The maximum number of channels is a compile-time parameter.

Downloads: 0 This Week

Last Update: 2021-02-20
See Project