147 projects for "image text input" with 2 filters applied:

  • Streamline Azure Security with Palo Alto Networks VM-Series Icon
    Streamline Azure Security with Palo Alto Networks VM-Series

    Centrally manage physical and virtualized firewalls with Panorama

    Improve your security posture and reduce incident response time. Use the VM-Series to natively analyze Azure traffic and dynamically drive policy updates based on workload changes.
    Learn more
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    image-blaster

    image-blaster

    An image-to-world skillset for Claude

    image-blaster is an image-to-world skillset that turns a single input image into a richer 3D production starting point. It uses Claude skills together with external generation services to create 3D environments, object meshes, Gaussian splats, and sound effects. The project is designed to accelerate early-stage 3D work by producing usable assets from visual references in just a few guided steps.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Speakr

    Speakr

    Speakr is a personal, self-hosted web application

    Speakr is an open-source, real-time text-to-speech (TTS) web application that allows users to convert written text into natural-sounding speech in just a few clicks. It provides a clean, user-friendly interface where users can input text, choose a voice style or language, and immediately hear the output, making it ideal for accessibility, content creation, and learning applications.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    PersonaPlex

    PersonaPlex

    PersonaPlex code

    ...PersonaPlex also supports persona and voice control, allowing developers to define the role and speaking style of the agent using text prompts and voice conditioning, making it suitable for applications like customized voice assistants, interactive character agents, or domain-specific conversational tools. Internally, it processes continuous audio streams in a hybrid input format so that speech understanding and generation occur jointly.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    Pokémon Cards CSS

    Pokémon Cards CSS

    Collection of advanced CSS styles to create realistic-looking effects

    pokemon-cards-css is a CSS-driven styling framework that lets web developers render Pokémon card visuals purely in HTML and CSS. It defines layouts, frames, typography, and image placeholders to mimic the look of real Pokémon trading cards, enabling users to create “virtual cards” with custom content. Because the design is built into CSS, cards respond to responsive constraints and adjust nicely across devices. The project supports common card types (basic, stage, trainer, etc.), and includes classes to manage energy symbols, attack boxes, and flavor text. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    ML Sharp

    ML Sharp

    Sharp Monocular View Synthesis in Less Than a Second

    ML Sharp is a research code release that turns a single 2D photograph into a photorealistic 3D representation that can be rendered from nearby viewpoints. Instead of requiring multi-view input, it predicts the parameters of a 3D Gaussian scene representation directly from one image using a single forward pass through a neural network. The core idea is speed: the 3D representation is produced in under a second on a standard GPU, and then the resulting scene can be rendered in real time to generate new views interactively. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    AnimateDiff

    AnimateDiff

    Plug-n-play module turning text-to-image models into animation

    AnimateDiff is an open-source project designed to enhance text-to-image diffusion models by adding animation capabilities. It allows users to turn static images generated by popular text-to-image models into animated sequences without requiring additional model training. This plug-and-play tool is compatible with a wide range of community models and facilitates the generation of animation directly from pre-existing text-to-image models. ...
    Leader badge
    Downloads: 30 This Week
    Last Update:
    See Project
  • 7
    Provides optical character recognition (OCR) solutions for Vietnamese language.
    Leader badge
    Downloads: 146 This Week
    Last Update:
    See Project
  • 8
    dktools - Dirk Krauses tools

    dktools - Dirk Krauses tools

    Drawing, graphics conversion, software development, administration.

    GUI and command line tools for advanced users and administrators: wxdkdraw - Minimalistic drawing application for use with LaTeX, wxd2lat - Convert wxdkdraw files to LaTeX, bitmap2pp - Convert PNG/JPEG/TIFF/NetPBM to (E)PS or PDF, fig2lat - Convert XFig files to LaTeX, htmlbook - publish HTML like a book, dkcpre - C debugging and tracing preprocessor, itadmin - manage your IT using a MySQL/MariaDB database, dk-fic - file integrity checker, dk-ls - list files, output column order is...
    Leader badge
    Downloads: 6 This Week
    Last Update:
    See Project
  • 9
    Snowmix

    Snowmix

    Video mixer for mixing live and recorded video and audio feeds

    ...Control over both CLI and a TCP connections. Video input and outputs can be done through GStreamer pipelines or the GStreamer shmsrc/shmsink API. Supported for Ubuntu, Mint, Debian, Alma, CentOS, EndeavourOS, Fedora, Mageia, Manjaro, MX Linux, OpenSUSE, RHEL, Rocky and macOS/OS X. Free support in the discussion forum. See Snowmix in action on Youtube http://www.youtube.com/user/Snowmix4video
    Downloads: 7 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 10
    simple3d

    simple3d

    realistic driving simulation + functions for 2D/3D graphics

    ...Simcar - driving (not racing) simulation, with stunts and realistic physics since version 5.0.0. GNU/Linux and Wind0w$ executables are available. SDL_grf - functions for 2D/3D graphics (including text), sound and input + a few programs for viewing 3D models, viewing ZX Spectrum *.scr files, simulating the Solar System etc. Simple3d - old program for rendering 3D models, now included in SDL_grf but not used in the latest programs.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    MLT Multimedia Framework
    A multimedia authoring and processing framework and a video playout server for television broadcasting.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 12
    JMP3Renamer
    JMP3Renamer is a plugin-based renamer/tagger written in Java. It supports automatical assignment of the data to the files and magic cookies to specify the filename format. Currently available plugins: Discogs, MusicBrainz, Filename, Filetag, Mp3, Ogg
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    fileaxy

    fileaxy

    Fileaxy does file sync, de-duplication, image matching & bulk preview

    Fileaxy is a file de-duplication, organization, synchronization, and bulk previewing tool which utilizes a new user interface for local file management. Using content hashing or machine vision algorithms, Fileaxy can detect identical files as well as similar names, images, videos, or fonts and correlate those to others based on naming conventions. Optionally integrates with ImageMagick, GraphicsMagick, FFmpeg, and Mac Sips file decoding with a simple button click. Fileaxy opens NO network...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    ARITA

    ARITA

    Extraordinary audio player for FreeBSD & GNU/Linux

    ...As for 'cuesheets': tracks are merged into a single continuous audio file and a supplementary text file, which provides information on where tracks start and end.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Pixelitor

    Pixelitor

    A Java image editor

    Pixelitor is a cross-platform raster graphics editor written in Java. It supports layers, layer masks, text layers, drawing, multiple undo, etc. It has more than 80 image filters and color adjustments, some of which are unique.
    Downloads: 50 This Week
    Last Update:
    See Project
  • 16
    AA means Ascii Art - the AAlib (ascii art GFX library), BB (audiovisual demonstration for your terminal), aview (image browser/animation player), AAvga (SVGAlib wrapper for AA-lib), ttyquake (text mode quake), aa3d (random dot stereogram generator)...
    Leader badge
    Downloads: 242 This Week
    Last Update:
    See Project
  • 17
    Motionity

    Motionity

    The web-based motion graphics editor for everyone

    ...It also supports animated text effects (fade, scale, type-writer), and can incorporate vector-based animations or Lottie animations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Rayshade

    Rayshade

    Rayshade raytracer

    now gnu-ized, gcc-4.7.4 or gcc-10.2.0 A raytracer does not require custom code for (shadows, mirrors) like GL: it uses physics to simulate light to make realistic images, leaving one to specify only what is in the scene. (at a cost of speed) Rayshade is a 1990's raytracer, a great one back then (and still useful). Rayshade has an excellent easy to read yet informative User's Guide that other's could not help but copy from. (html of guide is in...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Cyan

    Cyan

    Prepress image viewer and converter

    Cyan is an open source cross-platform image viewer and converter, designed for prepress (print) work. Like converting an image from RGB to CMYK, or the other way around. Cyan supports color profiles complying with the International Color Consortium (ICC) standard, and strives to create as color-accurate images as possible, with support for RGB, CMYK and GRAY with up to 32-bit image depth.
    Leader badge
    Downloads: 104 This Week
    Last Update:
    See Project
  • 20
    Perceptron

    Perceptron

    The birth of modern video feedback art.

    Perceptron is a video feedback engine with a variety of extraordinary graphical effects. Perceptron is an endless flow of transforming visuals. Perceptron * recursively transforms images and video streams in realtime and produces a combination of Julia fractals, IFS fractals, and chaotic patterns due to video feedback * evolves geometric patterns into the realm of infinite details and deepens the thought * records animations (movies) * saves and opens presets...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    A set of tools (command line and GUI) to provide a complete digital photo workflow for Unixes. EXIF headers are used as the central information repository, so users may change their software at any time without loosing any data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    gImageReader

    gImageReader

    A graphical frontend to tesseract-ocr

    ...Features include: - Import PDF documents and images from disk, scanning devices, clipboard and screenshots - Process multiple images and documents in one go - Manual or automatic recognition area definition - Recognize to plain text or to hOCR documents - Recognized text displayed directly next to the image - Post-process the recognized text, including spellchecking - Generate PDF documents from hOCR documents **Note**: This page is only a mirror for the downloads. Development is happening on github at https://github.com/manisandro/gImageReader, release binaries are also posted there.
    Leader badge
    Downloads: 105 This Week
    Last Update:
    See Project
  • 23
    Imaginary Teleprompter

    Imaginary Teleprompter

    Free teleprompter software

    Free teleprompter software. Built with web technologies so its easy to customize. Features include: mirroring, dual-screen support, rich text editing, image support, custom styles, and auto-save.
    Leader badge
    Downloads: 256 This Week
    Last Update:
    See Project
  • 24
    Marzipano

    Marzipano

    A 360° media viewer for the modern web

    Marzipano is a 360° media viewer designed for the modern web, allowing developers to display panoramic images and videos interactively with smooth performance and responsive controls. Built using HTML5, CSS3, and JavaScript, it supports multi-resolution tiling and optimized rendering to deliver efficient, high-quality experiences even with very large panoramas. The viewer can be easily embedded into web applications, offering controls for zooming, panning, and navigating between scenes....
    Downloads: 10 This Week
    Last Update:
    See Project
  • 25
    Render32

    Render32

    Command-line video compositing and audio mixing tools

    Render is a program for creating composite BMP image sequences. These images are composited as specified in a text configuration file. Mixer is a program for mixing film soundtracks. It accepts input files in WAV format and outputs a mixed soundtrack in WAV format. Each input channel can contain one or more audio files that are edited and mixed using a cue sheet. The maximum number of channels is a compile-time parameter.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB