Showing 158 open source projects for "open source png text"

View related business solutions
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 1
    imgp

    imgp

    Multi-core image resizer and rotator. Go crunch 'em!

    imgp is a command line image resizer and rotator for JPEG and PNG images. If you have tons of images you want to resize adaptively to a screen resolution or rotate by an angle using a single command, imgp is the utility for you. It can save a lot on storage too. Powered by multiprocessing, an intelligent adaptive algorithm, recursive operations, shell completion scripts, EXIF preservation (and more), imgp is a very flexible utility with well-documented easy to use options. imgp intends...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    auto-subtitle

    auto-subtitle

    Automatically generate and overlay subtitles for any video

    auto-subtitle is a Python-based command-line tool that automatically generates and overlays subtitles on video files using AI-driven speech recognition. It combines FFmpeg with OpenAI’s Whisper model to transcribe spoken audio into text and synchronize it with video playback. The tool processes video input, extracts audio, and produces subtitle files that can be either exported separately or burned directly into the final video output. It supports multiple transcription models with varying...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    Bulk Image Optimizer and Converter

    Bulk Image Optimizer and Converter

    Imagine having all your images well compressed and optimized :)

    Bulk Image Optimizer and Converter (Portable Executable) It allows users to choose the output format (JPEG, PNG, or WebP), set the desired image quality, and remove EXIF data. The optimized images are saved in a separate folder named "optimized" within the input folder. The tool displays progress information, including the number of images processed, the average compression ratio, and the total space saved.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4
    audio-diffusion-pytorch

    audio-diffusion-pytorch

    Audio generation using diffusion models, in PyTorch

    A fully featured audio diffusion library, for PyTorch. Includes models for unconditional audio generation, text-conditional audio generation, diffusion autoencoding, upsampling, and vocoding. The provided models are waveform-based, however, the U-Net (built using a-unet), DiffusionModel, diffusion method, and diffusion samplers are both generic to any dimension and highly customizable to work on other formats. Note: no pre-trained models are provided here, this library is meant for research...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 5
    mrViewer

    mrViewer

    Flipbook, Image Viewer and Audio-Video Player

    This project is no longer active. It has been replaced by mrv2 at: www.sourceforge.net/p/mrv2 A video player, interactive image viewer, and flipbook for use in VFX, 3D computer graphics and professional illustration.
    Leader badge
    Downloads: 79 This Week
    Last Update:
    See Project
  • 6
    Crunch PNG

    Crunch PNG

    Insane(ly slow but wicked good) PNG image optimization

    Crunch is a tool for lossy PNG image file optimization. It combines selective bit depth, color type, and color palette reduction with zopfli DEFLATE compression algorithm encoding using the pngquant and zopflipng PNG optimization tools. This approach leads to a significant file size gain relative to lossless approaches at the expense of a relatively modest decrease in image quality. Continuous benchmark testing is available in our GitHub Actions CI. Please see the benchmarks directory of...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Simple BPG Image viewer

    Simple BPG Image viewer

    This program allows you to view BPG Images in Windows,Linux and macOS.

    bpgviewer =========== This program allows you to view BPG Images in Windows,Linux and macOS (format description: http://bellard.org/bpg/).
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    Automatic YouTube subtitle generation

    Automatic YouTube subtitle generation

    Using OpenAI's Whisper to automatically generate YouTube subtitles

    Automatic YouTube subtitle generation is a command-line tool that combines YouTube downloading capabilities with AI-powered transcription using Whisper models. It allows users to download videos or audio from YouTube and automatically generate subtitles or transcripts. The tool processes media locally, extracting audio and applying speech recognition to produce accurate text outputs. It supports multiple languages and can handle different Whisper model sizes, balancing performance and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Crunch

    Crunch

    Insane(ly slow but wicked good) PNG image optimization

    Crunch is an image compression tool for lossy PNG image file optimization. Using a combination of selective bit depth, color palette reduction and color type, as well as zopfli DEFLATE compression algorithm encoding that employs the pngquant and zopflipng PNG optimization tools, Crunch is effectively able to optimize and compress images with minimal decrease in image quality. While it may produce file size gains larger than those produced by lossless approaches, the impact on image quality...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 10

    psgdump

    Dump psg/ym chip tune files to txt and midi format

    PSGDump tool is parser and converter for chip tune files. It supports PSG and YM input file formats, focusing on AY/YM chip tunes from ZX Spectrum and Atari ST. The tool produces text output of notes played and creates multi-track MIDI file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    AugLy

    AugLy

    A data augmentations library for audio, image, text, and video

    AugLy is a data augmentations library that currently supports four modalities (audio, image, text & video) and over 100 augmentations. Each modality’s augmentations are contained within its own sub-library. These sub-libraries include both function-based and class-based transforms, composition operators, and have the option to provide metadata about the transform applied, including its intensity. AugLy is a great library to utilize for augmenting your data in model training, or to evaluate...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    18k-youtube-download

    18k-youtube-download

    ❤️ 18k-youtube-download with python and kivy Dev.Wk-18k

    18k-youtube-download A simple project to make gui on kivy mixed with the function of downloading music from youtube with youtube_dl package
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    DeepSpeech

    DeepSpeech

    Open source embedded speech-to-text engine

    DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 14
    video-to-ascii

    video-to-ascii

    It is a simple python package to play videos in the terminal

    video-to-ascii is a Python-based command-line tool that converts and plays video files directly in the terminal using ASCII characters as visual representations. It processes each frame of a video, maps pixel brightness and color values to characters, and renders them in real time within terminal constraints. The tool adapts video resolution to match terminal dimensions, ensuring a coherent and readable output despite limited character space. It can optionally include audio playback when...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Bulk LUTs Previewer 2.8.4

    Bulk LUTs Previewer 2.8.4

    Generate a preview gallery for your LUTs.

    Quickly generate a gallery to preview your (Look Up Tables) luts on selected still frame, or video. 1) Import a still image. (jpg, jpeg, png, bmp) *) (Or select video. (mpg, mpeg, mp4, mov, mts, avi, webm, mkv)) 2) Choose path to your lutsfolder. (cube, 3dl, dat, m3d) 3) Generate. (If video selected, use scroll wheel to preview luts) All preview images that is generated is stored in '.\OUT' folder, and then deleted upon restart of the application. > [Changelog] - Version...
    Downloads: 17 This Week
    Last Update:
    See Project
  • 16
    Linux-Intelligent-Ocr-Solution

    Linux-Intelligent-Ocr-Solution

    Easy-OCR solution and Tesseract trainer for GNU/Linux

    Linux-intelligent-ocr-solution Lios is a free and open source software for converting print in to text using either scanner or a camera, It can also produce text out of scanned images from other sources such as Pdf, Image, Folder containing Images or screenshot. Program is given total accessibility for visually impaired. A Tesseract Trainer GUI is also shipped with this package. Forum : https://groups.google.com/forum/#!
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17

    srt2vobsub

    A command-line tool that converts text subtitles to vobsub

    A command line tool for Linux and other Unix-like systems that generates a pair of .idx/.sub vobsub subtitles files from textual subtitles.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    FastoCloud PRO

    IPTV/NVR/CCTV/Video cloud https://fastocloud.com

    IPTV/Video cloud Features: Cross-platform (Linux, MacOSX, FreeBSD, Raspbian/Armbian) GPU/CPU Encode/Decode/Post Processing Stream statistics CCTV Adaptive hls streams Load balancing Temporary urls HLS push EPG scanning Subtitles to text conversions AD insertion Logo overlay Video effects Relays Timeshifts Catchups Playlists Restream/Transcode from online streaming services like Youtube, Twitch ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    SmartBody

    Character animation system for games and simulations.

    SmartBody is available for download for Windows, Linux and OSX users. SmartBody can also be used on Android and iOS platforms. SmartBody is a character animation platform that provides the following capabilities in real time: * Locomotion (walk, jog, run, turn, strafe, jump, etc.) * Steering - avoiding obstacles and moving objects * Object manipulation - reach, grasp, touch , pick up objects * Lip Syncing - characters can speak with simultaneous lip-sync using text-to-speech or...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    PyTorch Natural Language Processing

    PyTorch Natural Language Processing

    Basic Utilities for PyTorch Natural Language Processing (NLP)

    ...It’s built with the very latest research in mind, and was designed from day one to support rapid prototyping. PyTorch-NLP comes with pre-trained embeddings, samplers, dataset loaders, metrics, neural network modules and text encoders. It’s open-source software, released under the BSD3 license. With your batch in hand, you can use PyTorch to develop and train your model using gradient descent. For example, check out this example code for training on the Stanford Natural Language Inference (SNLI) Corpus. Now you've setup your pipeline, you may want to ensure that some functions run deterministically. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Defox text to speech and downloader

    Defox text to speech and downloader

    Written or imported text offline read or online download.

    This software design to convert text to speech and download the converted speech. Description : • Installation setup with two languages (English, French) • Two areas called text reading and speech downloading • Many languages supported to download center Note 1: I'm a student yet and I'm not in the software designing industry. Therefore maybe I haven't software making skills. I'm worried about that. ! Note 2 : When you double click on the software maybe it will get some seconds...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    oscp

    oscp

    multiplatform, small and handy audio/video player with network remote

    oscp by ariel/KotCzarny @ irc://irc.freenode.com/h3droid tiny media player: - libav (mp3, wav, ogg, aac, mpc, wma, flac, ape, avi, mkv, flv, etc.) - wildmidi (mid) - xmp (mod, med, xm, s3m, it, dbm, psm, omx, okt, digi, 669, mtm, acid, umx) - gme (ay, gbs, gym, hes, kss, nsf, nsfe, sap, spc, vgm) - sidplay2 (sid, psid, info) - mdxplay (mdx) - fc14dec (fc, fc13, fc14) - sc68 (sc68, sndh) - asapconv (sap, cmc, cm3, cmr, cms, dmc, dlt, mpt, mpd, rmt, tmc, tm8, tm2) -...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Canorus

    Canorus

    Music score editor

    Canorus is a free cross-platform music score editor. It supports an unlimited number and length of staffs, polyphony, a MIDI playback of notes, chord markings, lyrics, import/export filters to formats like MIDI, MusicXML, ABC Music, MusiXTeX and LilyPond
    Downloads: 21 This Week
    Last Update:
    See Project
  • 24

    TimingDrawer

    Text based timing diagram generator

    This tool generates timing diagrams for documenting hardware design. It reads the description from a text file with a simple syntax. It generates vector graphic (EPS, SVG or EMF format). It can be used in command line mode or with a GUI. It is written in Python and works on any platform.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Exposure

    Exposure

    Learning infinite-resolution image processing with GAN and RL

    Learning infinite-resolution image processing with GAN and RL from unpaired image datasets, using a differentiable photo editing model. ACM Transactions on Graphics (presented at SIGGRAPH 2018) Exposure is originally designed for RAW photos, which assumes 12+ bit color depth and linear "RGB" color space (or whatever we get after demosaicing). jpg and png images typically have only 8-bit color depth (except 16-bit pngs) and the lack of information (dynamic range/activation resolution) may...
    Downloads: 1 This Week
    Last Update:
    See Project