Showing 21 open source projects for "image text input"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • Streamline Azure Security with Palo Alto Networks VM-Series Icon
    Streamline Azure Security with Palo Alto Networks VM-Series

    Centrally manage physical and virtualized firewalls with Panorama

    Improve your security posture and reduce incident response time. Use the VM-Series to natively analyze Azure traffic and dynamically drive policy updates based on workload changes.
    Learn more
  • 1
    Real-ESRGAN GUI

    Real-ESRGAN GUI

    Cross-platform GUI for image upscaler Real-ESRGAN

    ...Real-ESRGAN can only enlarge the input image with a fixed 2-4x magnification (related to the selected model). This functionality is achieved by downsampling using a conventional scaling algorithm after multiple calls to Real-ESRGAN. Split each frame of the GIF and record the duration, zoom in one by one and then merge. Drag an image file or directory to any position in the window, and its path can be automatically set as the input.
    Downloads: 81 This Week
    Last Update:
    See Project
  • 2
    Windrecorder

    Windrecorder

    Windrecorder is a memory search app by records everything

    Windrecorder is an open-source personal memory search engine that continuously records on-screen activity in a highly optimized and storage-efficient format. It captures screen content locally and builds a searchable database using OCR and image understanding, allowing users to rewind and rediscover anything they have previously seen. The system indexes only meaningful visual changes, extracting text, browser data, and contextual information to improve search accuracy and reduce storage overhead. It includes a web-based interface where users can browse timelines, analyze activity, and perform semantic queries on recorded content. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Segmentation Models

    Segmentation Models

    Segmentation models with pretrained backbones. PyTorch

    ...Preparing your data the same way as during weights pre-training may give you better results (higher metric score and faster convergence). It is not necessary in case you train the whole model, not only the decoder. Pytorch Image Models (a.k.a. timm) has a lot of pretrained models and interface which allows using these models as encoders in smp, however, not all models are supported. Input channels parameter allows you to create models, which process tensors with an arbitrary number of channels.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Shutter Encoder

    Shutter Encoder

    Free professional video converter Windows|Mac|Linux

    Shutter Encoder is an video, audio and image converter based on FFmpeg and other great tools. It has been designed by video editors in order to be as accessible and efficient as possible. It's a swiss knife tool for any video editor. Link to website & downloads : https://www.shutterencoder.com - Without conversion: Cut without re-encoding, Replace audio, Rewrap, Conform, Merge, Extract, Subtitling, Video inserts - Sound conversions: WAV, AIFF, FLAC, ALAC, MP3, AAC, AC3,...
    Leader badge
    Downloads: 79 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    Snowmix

    Snowmix

    Video mixer for mixing live and recorded video and audio feeds

    ...Control over both CLI and a TCP connections. Video input and outputs can be done through GStreamer pipelines or the GStreamer shmsrc/shmsink API. Supported for Ubuntu, Mint, Debian, Alma, CentOS, EndeavourOS, Fedora, Mageia, Manjaro, MX Linux, OpenSUSE, RHEL, Rocky and macOS/OS X. Free support in the discussion forum. See Snowmix in action on Youtube http://www.youtube.com/user/Snowmix4video
    Downloads: 7 This Week
    Last Update:
    See Project
  • 6
    MLT Multimedia Framework
    A multimedia authoring and processing framework and a video playout server for television broadcasting.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 7
    TTGO VGA32 Lite
    NOTE: This project is marked with 'Status: Abandoned' on SourceForge because not enough time can be dedicated to this project. However it may still get sporadic commits to the repository.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    auto-subtitle

    auto-subtitle

    Automatically generate and overlay subtitles for any video

    auto-subtitle is a Python-based command-line tool that automatically generates and overlays subtitles on video files using AI-driven speech recognition. It combines FFmpeg with OpenAI’s Whisper model to transcribe spoken audio into text and synchronize it with video playback. The tool processes video input, extracts audio, and produces subtitle files that can be either exported separately or burned directly into the final video output. It supports multiple transcription models with varying accuracy and performance, allowing users to balance speed and quality depending on their needs. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    VSGAN

    VSGAN

    VapourSynth Single Image Super-Resolution Generative Adversarial

    Single Image Super-Resolution Generative Adversarial Network (GAN) which uses the VapourSynth processing framework to handle input and output image data. Transform, Filter, or Enhance your input video, or the VSGAN result with VapourSynth, a Script-based NLE. You can chain models or re-run the model twice-over (or more). Have low VRAM? Don’t worry!
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • 10
    Imaginary Teleprompter

    Imaginary Teleprompter

    Free teleprompter software

    Free teleprompter software. Built with web technologies so its easy to customize. Features include: mirroring, dual-screen support, rich text editing, image support, custom styles, and auto-save.
    Leader badge
    Downloads: 256 This Week
    Last Update:
    See Project
  • 11
    Render32

    Render32

    Command-line video compositing and audio mixing tools

    Render is a program for creating composite BMP image sequences. These images are composited as specified in a text configuration file. Mixer is a program for mixing film soundtracks. It accepts input files in WAV format and outputs a mixed soundtrack in WAV format. Each input channel can contain one or more audio files that are edited and mixed using a cue sheet. The maximum number of channels is a compile-time parameter.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Super-résolution via CNN

    Super-résolution via CNN

    Super resolution using a CNN, based on the work of the DGtal team

    ...This program will generate "model_epoch_ .pth" files corresponding to the model at epoch n, in a folder saved_model_u t_bs bs_tbs tbs_lr lr, where corresponds to the scale factor, bsthe size of the training batch, tbsthe size of the test batch and lrto the learning rate. Low res images should be located in a "dataset/input" folder, and high res targets in a "dataset/target" folder, where each different quality image has the same name in both folders.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Anime4kSharp

    Anime4kSharp

    Anime4K implemented in C#

    ...This time could possibly be reduced with further optimization. Images are processed in four phases that are executed on a pixel- per- pixel basis. Each phase takes a input image and renders it to a output image. This makes it easy to port the algorithm (back) to GLSL fragment shaders. As bloc97 described in his pseudo-preprint, the Anime4K algorithm is actually quite simple.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    VideoMan Library

    VideoMan Library

    C++ library for image acquisition and visualization

    Library for capturing video from cameras, 3d sensors, frame-grabbers, video files and image sequences. It can also display multiple images using OpenGL with different layouts. Easy integration with OpenCV, CUDA... Perfect for computer vision. Keywords: video capture, computer vision, machine vision, opencv, opengl, cameras, video input devices, firewire, usb, gige
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    srtCreator

    creates .srt file from .txt file.

    Converts plain text (.txt) into subtitles(.srt), using punctuation to nicely cut between subs. Detects sentences with a regex. Evaluates the best character count for each subs. Tries to cut at commas, colon and semi-colons before cutting between words. in v1.2: live preview with update of max sub length
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Vianto

    Vianto

    Video Annotation Tool

    ...Vianto is a Java-based video annotation / coding tool with graphical user interface that allows you to: - Record video (in OSX only) - Save and load markers to code the video with (timestamps automatically generated for events) - Double click on events and the video will jump to the right place in the video - Click a marker to select start time, click again to set end time of code or preset a plus/minus time (in seconds) - Wildcard code to input free text - Compare multiple codings and create a consolidated set of events - Link multiple videos together to view multiple angles shot from different cameras at the same time Built with VLCj and packaged to run on Windows and OSX without the need to install VLC. Distributed freely according to the terms of the GPL license. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    easyjasub

    easyjasub

    Tool to add furigana and in-line translation to Japanese subtitles

    ...It allows you to get subtitles with furigana and in-line translation, in several formats suitable for both video rendering and self-study Timing for the Japanese subtitles is used, the secondary subtitles may be repeated into multiple lines. The synchronization of input text subtitles is very important to properly associate them. The program is now in early development stage, you may need to read the the inline help (-h option) or browse the code in http://github.com/riccardove/easyjasub A JRE is required, even by the Windows .exe version. wkhtmltoimage is required, take it from http://sf.net/p/wkhtmltopdf BDSup2Sub may be useful (BDN/XML subtitles are created as intermediate output). ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18

    Instagram Downloader

    Download images and video from Instagram

    Download any image or video from Instagram by copying the URL in the address bar into the text field and choosing a save location.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Java Open Framework (JOF) is a suite of applications extending your creative potential. JOF includes a HTML creator, image editor, text editor and an audio/visual player and converter.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    This software generates animated image slide shows using GIF Files. It offers suppert to more than 255 colors. As a input image you can use jpeg, gif, png and all other image files supported by the Java Runtime.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Amiant Navigator is a cross-platform plug-in based all-in-one file manager, media content viewer/editor/converter, archiver, text editor, ftp browser.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB