image text input free download

Showing 34 open source projects for "image text input"

View related business solutions

Graphics Python Clear Filters & Widen Search

Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.

Start Free
1

IOPaint

Image inpainting tool powered by SOTA AI Model

...Its feature set includes erasing people, watermarks, or defects, adding or replacing objects, applying text-aware edits, and extending images outward (outpainting) to fill contours or expand compositions.

Downloads: 20 This Week

Last Update: 2026-02-03
See Project
2

Dream Textures

Stable Diffusion built-in to Blender

Create textures, concept art, background assets, and more with a simple text prompt. Use the 'Seamless' option to create textures that tile perfectly with no visible seam. Texture entire scenes with 'Project Dream Texture' and depth to image. Re-style animations with the Cycles render pass. Run the models on your machine to iterate without slowdowns from a service. Create textures, concept art, and more with text prompts.

Downloads: 2 This Week

Last Update: 2024-08-26
See Project
3

PersonaLive

Expressive Portrait Image Animation for Live Streaming

PersonaLive is an open-source diffusion-based portrait animation framework focused on generating expressive, long-duration animated sequences in real time, primarily for live streaming or interactive applications. It leverages deep generative models that condition on a static reference image and a driving input (such as motion or expression cues) to produce a seamless animated portrait sequence that can run indefinitely without segmentation artifacts. The framework prioritizes low-latency and streamable output, making it suitable for real-time creative workflows, broadcast overlays, or interactive avatars on consumer-grade GPUs. ...

Downloads: 3 This Week

Last Update: 5 days ago
See Project
4

ML Sharp

Sharp Monocular View Synthesis in Less Than a Second

ML Sharp is a research code release that turns a single 2D photograph into a photorealistic 3D representation that can be rendered from nearby viewpoints. Instead of requiring multi-view input, it predicts the parameters of a 3D Gaussian scene representation directly from one image using a single forward pass through a neural network. The core idea is speed: the 3D representation is produced in under a second on a standard GPU, and then the resulting scene can be rendered in real time to generate new views interactively. ...

Downloads: 0 This Week

Last Update: 2026-01-29
See Project
Secure File Transfer for Windows with Cerberus by Redwood
Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.

Try for Free
5

Mesh R-CNN

code for Mesh R-CNN, ICCV 2019

...Unlike voxel-based or point-based approaches, Mesh R-CNN uses a differentiable mesh representation, allowing it to efficiently refine surface geometry while maintaining high spatial detail. The system combines 2D detection from Mask R-CNN with 3D reasoning modules that output full mesh reconstructions aligned with the input image. It has been evaluated on datasets such as Pix3D, where it demonstrates state-of-the-art performance in reconstructing real-world object geometry.

Downloads: 0 This Week

Last Update: 2 days ago
See Project
6

AnimateDiff

Plug-n-play module turning text-to-image models into animation

AnimateDiff is an open-source project designed to enhance text-to-image diffusion models by adding animation capabilities. It allows users to turn static images generated by popular text-to-image models into animated sequences without requiring additional model training. This plug-and-play tool is compatible with a wide range of community models and facilitates the generation of animation directly from pre-existing text-to-image models. ...

1 Review

Downloads: 30 This Week

Last Update: 2025-03-06
See Project
7

stmani3

Stereo Photo Manipulation

A set of programs for Alignment and Rendering of still Stereo Photos (3D). This is a Python3 updated version of the old StMani

Downloads: 0 This Week

Last Update: 2024-07-13
See Project
8

DALL-E 2 - Pytorch

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch. The main novelty seems to be an extra layer of indirection with the prior network (whether it is an autoregressive transformer or a diffusion network), which predicts an image embedding based on the text embedding from CLIP. Specifically, this repository will only build out the diffusion prior network, as it is the best performing variant (but which incidentally involves a causal transformer as the denoising network) To train DALLE-2 is a 3 step process, with the training of CLIP being the most important. ...

Downloads: 0 This Week

Last Update: 2023-10-19
See Project
9

Stable Diffusion in Docker

Run the Stable Diffusion releases in a Docker container

...Create an image from an existing image and a text prompt. Modify an existing image with its depth map and a text prompt.

Downloads: 0 This Week

Last Update: 2023-09-22
See Project
Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
10

PicResize

A simple pic resizer

A simple pic resizer working with drag and drop. Drag and drop an image file on a shortcut to the program, input width or height, confirm, find your resized image in the same folder with new dimensions in the file name.

Downloads: 1 This Week

Last Update: 2023-12-09
See Project
11

Bulk Image Optimizer and Converter

Imagine having all your images well compressed and optimized :)

Bulk Image Optimizer and Converter (Portable Executable) It allows users to choose the output format (JPEG, PNG, or WebP), set the desired image quality, and remove EXIF data. The optimized images are saved in a separate folder named "optimized" within the input folder. The tool displays progress information, including the number of images processed, the average compression ratio, and the total space saved.

Downloads: 0 This Week

Last Update: 2023-05-03
See Project
12

ExiFlow

A set of tools (command line and GUI) to provide a complete digital photo workflow for Unixes. EXIF headers are used as the central information repository, so users may change their software at any time without loosing any data.

1 Review

Downloads: 0 This Week

Last Update: 2022-04-13
See Project
13

TRACER

Extreme Attention Guided Salient Object Tracing Network

Extreme Attention Guided Salient Object Tracing Network (AAAI 2022) implementation in PyTorch. Now, fast inference mode offers a salient object result with the mask. You can get the more clear salient object by tuning the threshold. We will release initializing TRACER with a version of pre-trained TE-x.

Downloads: 0 This Week

Last Update: 2023-04-05
See Project
14

PythonStarSplitter

A Python Script I made to split a starfield image into several layers.

A Python Script I made to split a starfield image into several layers. To be able to use the script, PixInsight with an installed Gaia data catalogue is required, as it needs the exported astrometry data text file.

Downloads: 0 This Week

Last Update: 2021-12-21
See Project
15

3DDFA

Fast, accurate and stable 3D dense face alignment

...A simple 3D render written by c++ and cython is also included. This repo supports the onnxruntime, and the latency of regressing 3DMM parameters using the default backbone is about 1.35ms/image on CPU with a single image as input. See requirements.txt, tested on macOS and Linux platforms. The Windows users may refer to FQA for building issues. Note that this repo uses Python3. The major dependencies are PyTorch, numpy, opencv-python and onnxruntime, etc.

Downloads: 2 This Week

Last Update: 2022-02-21
See Project
16

Linux-Intelligent-Ocr-Solution

Easy-OCR solution and Tesseract trainer for GNU/Linux

Linux-intelligent-ocr-solution Lios is a free and open source software for converting print in to text using either scanner or a camera, It can also produce text out of scanned images from other sources such as Pdf, Image, Folder containing Images or screenshot. Program is given total accessibility for visually impaired. A Tesseract Trainer GUI is also shipped with this package. Forum : https://groups.google.com/forum/#!forum/lios Video Tutorial : https://www.youtube.com/playlist?...

5 Reviews

Downloads: 10 This Week

Last Update: 2020-10-19
See Project
17

GIF for CLI

Takes in a GIF, short video, or a query to the Tenor GIF API

gif-for-cli is a small, playful utility that brings animated GIFs to the command line by rendering frames directly in a terminal. It takes an input GIF (or a URL) and converts each frame into a terminal-friendly representation, timing updates to approximate the original animation. Depending on terminal capabilities, it can use ANSI color blocks or image protocols to achieve surprisingly faithful playback. The tool includes conveniences such as looping control, scaling to fit your terminal, and caching to avoid repeated downloads. ...

Downloads: 0 This Week

Last Update: 2025-10-09
See Project
18

TimingDrawer

Text based timing diagram generator

This tool generates timing diagrams for documenting hardware design. It reads the description from a text file with a simple syntax. It generates vector graphic (EPS, SVG or EMF format). It can be used in command line mode or with a GUI. It is written in Python and works on any platform.

Downloads: 0 This Week

Last Update: 2019-01-25
See Project
19

Windows Spotlight Slideshow Update

App maintains a slideshow image folder using Windows Spotlight images.

This app maintains a slideshow image folder, using Windows Spotlight images that meet the customizable selection criteria. The app adds or deletes slideshow images based on additions to or deletions from the Windows Spotlight folder by Windows Spotlight. The slideshow folder can be specified in Windows Settings background personalization as the input to a desktop/background slideshow.

Downloads: 0 This Week

Last Update: 2017-09-21
See Project
20

Training Image Operators from Samples

Tools to train Image Operators automatically from a set of samples.

TRIOS - Training Image Operators from Samples is a set of tools to bring Image Processing closer to scientists in general. It is capable of estimating an operator between two images using only pairs of samples that contain an input image and the desired output. The operator is saved to a file and can be applied to any image.

Downloads: 0 This Week

Last Update: 2017-07-31
See Project
21

PDF Report Generator from image maps.

GimpPy uses img maps & an img as the input, output is a report.py file used to generate PDFs, the out files may run solo or chained together to make more complex multi page reports. Input required is a dict with vals for flds you have mapped on your img.

Downloads: 0 This Week

Last Update: 2015-05-18
See Project
22

Newspaper3k

News, full-text, and article metadata extraction in Python 3

Inspired by requests for its simplicity and powered by lxml for its speed. Newspaper is an amazing python library for extracting & curating articles. Newspaper delivers Instapaper style article extraction. Newspaper is a Python3 library! If you are certain that an entire news source is in one language, go ahead and use the same api. Works in 10+ languages, English, Chinese, German, Arabic, and more! On python3 you must install newspaper3k, not newspaper. newspaper is our python2 library....

Downloads: 0 This Week

Last Update: 2021-05-26
See Project
23

Open Asset Import Library

Importer library to import assets from different common 3D file formats such as Collada, Blend, Obj, X, 3DS, LWO, MD5, MD2, MD3, MDL, MS3D and a lot of other formats. The data is stored in an own in-memory data-format, which can be easily processed. www.open3mod.com/ is a 3D model viewer and exporter based on Assimp that is also Open Source.

24 Reviews

Downloads: 31 This Week

Last Update: 2014-06-21
See Project
24

scrimage

A unique python-based image editor

A unique python-based image editor with low-level control. It will be able to apply fairly complex mathematical operations to individual pixels based on the contents of a script or user input at a command line. It will then be able to apply those changes to the image for a unique effect.

Downloads: 0 This Week

Last Update: 2015-10-13
See Project
25

EnKoDeur-Mixeur

EnKoDeur-Mixeur (EKD) is an open source software which makes videos, pictures and audio post-production. It can be also used to convert videos in many formats. It is written in python and use the PyQt4 bindings.

1 Review

Downloads: 0 This Week

Last Update: 2013-04-30
See Project