Showing 28 open source projects for "image text input"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 1
    Real-ESRGAN GUI

    Real-ESRGAN GUI

    Cross-platform GUI for image upscaler Real-ESRGAN

    ...Real-ESRGAN can only enlarge the input image with a fixed 2-4x magnification (related to the selected model). This functionality is achieved by downsampling using a conventional scaling algorithm after multiple calls to Real-ESRGAN. Split each frame of the GIF and record the duration, zoom in one by one and then merge. Drag an image file or directory to any position in the window, and its path can be automatically set as the input.
    Downloads: 81 This Week
    Last Update:
    See Project
  • 2
    Windrecorder

    Windrecorder

    Windrecorder is a memory search app by records everything

    Windrecorder is an open-source personal memory search engine that continuously records on-screen activity in a highly optimized and storage-efficient format. It captures screen content locally and builds a searchable database using OCR and image understanding, allowing users to rewind and rediscover anything they have previously seen. The system indexes only meaningful visual changes, extracting text, browser data, and contextual information to improve search accuracy and reduce storage overhead. It includes a web-based interface where users can browse timelines, analyze activity, and perform semantic queries on recorded content. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Segmentation Models

    Segmentation Models

    Segmentation models with pretrained backbones. PyTorch

    ...Preparing your data the same way as during weights pre-training may give you better results (higher metric score and faster convergence). It is not necessary in case you train the whole model, not only the decoder. Pytorch Image Models (a.k.a. timm) has a lot of pretrained models and interface which allows using these models as encoders in smp, however, not all models are supported. Input channels parameter allows you to create models, which process tensors with an arbitrary number of channels.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Shutter Encoder

    Shutter Encoder

    Free professional video converter Windows|Mac|Linux

    Shutter Encoder is an video, audio and image converter based on FFmpeg and other great tools. It has been designed by video editors in order to be as accessible and efficient as possible. It's a swiss knife tool for any video editor. Link to website & downloads : https://www.shutterencoder.com - Without conversion: Cut without re-encoding, Replace audio, Rewrap, Conform, Merge, Extract, Subtitling, Video inserts - Sound conversions: WAV, AIFF, FLAC, ALAC, MP3, AAC, AC3,...
    Leader badge
    Downloads: 79 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    Snowmix

    Snowmix

    Video mixer for mixing live and recorded video and audio feeds

    ...Control over both CLI and a TCP connections. Video input and outputs can be done through GStreamer pipelines or the GStreamer shmsrc/shmsink API. Supported for Ubuntu, Mint, Debian, Alma, CentOS, EndeavourOS, Fedora, Mageia, Manjaro, MX Linux, OpenSUSE, RHEL, Rocky and macOS/OS X. Free support in the discussion forum. See Snowmix in action on Youtube http://www.youtube.com/user/Snowmix4video
    Downloads: 7 This Week
    Last Update:
    See Project
  • 6
    MLT Multimedia Framework
    A multimedia authoring and processing framework and a video playout server for television broadcasting.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 7
    TTGO VGA32 Lite
    NOTE: This project is marked with 'Status: Abandoned' on SourceForge because not enough time can be dedicated to this project. However it may still get sporadic commits to the repository.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    VSGAN

    VSGAN

    VapourSynth Single Image Super-Resolution Generative Adversarial

    Single Image Super-Resolution Generative Adversarial Network (GAN) which uses the VapourSynth processing framework to handle input and output image data. Transform, Filter, or Enhance your input video, or the VSGAN result with VapourSynth, a Script-based NLE. You can chain models or re-run the model twice-over (or more). Have low VRAM? Don’t worry!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Imaginary Teleprompter

    Imaginary Teleprompter

    Free teleprompter software

    Free teleprompter software. Built with web technologies so its easy to customize. Features include: mirroring, dual-screen support, rich text editing, image support, custom styles, and auto-save.
    Leader badge
    Downloads: 256 This Week
    Last Update:
    See Project
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 10
    Render32

    Render32

    Command-line video compositing and audio mixing tools

    Render is a program for creating composite BMP image sequences. These images are composited as specified in a text configuration file. Mixer is a program for mixing film soundtracks. It accepts input files in WAV format and outputs a mixed soundtrack in WAV format. Each input channel can contain one or more audio files that are edited and mixed using a cue sheet. The maximum number of channels is a compile-time parameter.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Consistent Depth

    Consistent Depth

    We estimate dense, flicker-free, geometrically consistent depth

    ...The system builds upon traditional structure-from-motion (SfM) techniques to provide geometric constraints while integrating a convolutional neural network trained for single-image depth estimation. During inference, the model fine-tunes itself to align with the geometric constraints of a specific input video, ensuring stable and realistic depth maps even in less-constrained regions. This approach achieves improved geometric consistency and visual stability compared to prior monocular reconstruction methods. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Super-résolution via CNN

    Super-résolution via CNN

    Super resolution using a CNN, based on the work of the DGtal team

    ...This program will generate "model_epoch_ .pth" files corresponding to the model at epoch n, in a folder saved_model_u t_bs bs_tbs tbs_lr lr, where corresponds to the scale factor, bsthe size of the training batch, tbsthe size of the test batch and lrto the learning rate. Low res images should be located in a "dataset/input" folder, and high res targets in a "dataset/target" folder, where each different quality image has the same name in both folders.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Anime4kSharp

    Anime4kSharp

    Anime4K implemented in C#

    ...This time could possibly be reduced with further optimization. Images are processed in four phases that are executed on a pixel- per- pixel basis. Each phase takes a input image and renders it to a output image. This makes it easy to port the algorithm (back) to GLSL fragment shaders. As bloc97 described in his pseudo-preprint, the Anime4K algorithm is actually quite simple.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    VideoMan Library

    VideoMan Library

    C++ library for image acquisition and visualization

    Library for capturing video from cameras, 3d sensors, frame-grabbers, video files and image sequences. It can also display multiple images using OpenGL with different layouts. Easy integration with OpenCV, CUDA... Perfect for computer vision. Keywords: video capture, computer vision, machine vision, opencv, opengl, cameras, video input devices, firewire, usb, gige
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    srtCreator

    creates .srt file from .txt file.

    Converts plain text (.txt) into subtitles(.srt), using punctuation to nicely cut between subs. Detects sentences with a regex. Evaluates the best character count for each subs. Tries to cut at commas, colon and semi-colons before cutting between words. in v1.2: live preview with update of max sub length
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Vianto

    Vianto

    Video Annotation Tool

    ...Vianto is a Java-based video annotation / coding tool with graphical user interface that allows you to: - Record video (in OSX only) - Save and load markers to code the video with (timestamps automatically generated for events) - Double click on events and the video will jump to the right place in the video - Click a marker to select start time, click again to set end time of code or preset a plus/minus time (in seconds) - Wildcard code to input free text - Compare multiple codings and create a consolidated set of events - Link multiple videos together to view multiple angles shot from different cameras at the same time Built with VLCj and packaged to run on Windows and OSX without the need to install VLC. Distributed freely according to the terms of the GPL license. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    QGifer

    QGifer

    A video-based animated GIF creator

    QGifer is a tool for extracting part of a video to an animated GIF file. The new demonstration video: http://youtu.be/SNTf5eNdL4Q
    Leader badge
    Downloads: 57 This Week
    Last Update:
    See Project
  • 18
    easyjasub

    easyjasub

    Tool to add furigana and in-line translation to Japanese subtitles

    ...It allows you to get subtitles with furigana and in-line translation, in several formats suitable for both video rendering and self-study Timing for the Japanese subtitles is used, the secondary subtitles may be repeated into multiple lines. The synchronization of input text subtitles is very important to properly associate them. The program is now in early development stage, you may need to read the the inline help (-h option) or browse the code in http://github.com/riccardove/easyjasub A JRE is required, even by the Windows .exe version. wkhtmltoimage is required, take it from http://sf.net/p/wkhtmltopdf BDSup2Sub may be useful (BDN/XML subtitles are created as intermediate output). ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19

    Instagram Downloader

    Download images and video from Instagram

    Download any image or video from Instagram by copying the URL in the address bar into the text field and choosing a save location.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    EnKoDeur-Mixeur
    EnKoDeur-Mixeur (EKD) is an open source software which makes videos, pictures and audio post-production. It can be also used to convert videos in many formats. It is written in python and use the PyQt4 bindings.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    N-Sense

    N-Sense

    N-Sense Sensory substitution software

    N-Sense is sensory substitution software. Whit goal to make sensory substitution and super senses available to everyone. N-sense uses hearing as it's input port and is(will be) capable to allow multiple senses trough it as vision, magnetic orientation, humidity, velocity,time, altitude... Almost any data from computer can be made into sense and included into project. N-Sense is modular by design, written mostly in c++ to allow optimal processing speed and uses OpenCv for Image/video processing. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    GOFoto is an appliaction for managing large collections of photos. It allows photo refining, generating web gallery and VideoCD.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    FRONTEND for mencoder for Windows and Linux. Convert rmvb,avi,mp4,wmv, ogv, mkv, mov, mpg, vob,ogv, ogg, ogm etc To avi (divx5), xvid, dvd iso, h264 in a simple way. Required java runtime 6 and over. (On Linux is required mencoder, dvdauthor and mkisofs)
    Downloads: 6 This Week
    Last Update:
    See Project
  • 24
    EMC is a Entertaiment-System for playing and displaying Multimedia-Data like Audio, Video, Text, ....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Java Open Framework (JOF) is a suite of applications extending your creative potential. JOF includes a HTML creator, image editor, text editor and an audio/visual player and converter.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB