Showing 29 open source projects for "java screen capture"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    comfyui-mixlab-nodes

    comfyui-mixlab-nodes

    Workflow and speech recognition app

    ...It introduces a “Workflow-to-APP” concept, where a ComfyUI graph can be transformed into a Web App through an AppInfo node, complete with categories, batch prompts, and editable configurations. The project also brings Real-time Design features like screen capture and floating video nodes, enabling creative pipelines that mix live screen content, generative models, and visual effects. For audio and speech, it provides nodes for SpeechRecognition and SpeechSynthesis, plus workflows that combine voice generation with real-time face swapping and other audio-visual effects. On the AI side, it integrates multiple LLM providers (cloud and local), supports OpenAI-compatible endpoints, Siliconflow models, and includes prompt-focused utilities for random prompt generation, Chinese prompts, clip interrogation.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 2
    Pot Desktop

    Pot Desktop

    A cross-platform software for text translation and recognition

    Pot-Desktop is a cross-platform productivity tool aimed at helping users quickly translate, perform OCR (optical character recognition), and synthesize speech for selected text or images — all with minimal friction. It supports picking text via mouse selection (“highlight-and-translate”), clipboard listening, or screenshot-based OCR; this makes it ideal for reading webpages, documents, images — or any on-screen text — and instantly getting translations or text extraction. The tool supports...
    Downloads: 21 This Week
    Last Update:
    See Project
  • 3
    Live API Web Console

    Live API Web Console

    A react-based starter app for using the Live API over websockets

    Live API Web Console is a React starter that demonstrates how to use Gemini’s Live API over WebSockets to build real-time, multimodal experiences. The app includes modules for streaming audio playback, recording user media from the microphone, webcam, or even screen capture, and it surfaces a unified event log so you can debug the session as it flows. Configuration lives in a simple .env file and the project boots with standard web tooling, letting you experiment quickly with models, system prompts, and tool declarations. It ships with demo branches that show grounded search, function calling, and visualization—one example has the model calling a function that renders Vega/Altair graphs directly in the UI. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    JavaCV

    JavaCV

    Java interface to OpenCV, FFmpeg, and more

    JavaCV uses wrappers from the JavaCPP Presets of commonly used libraries by researchers in the field of computer vision (OpenCV, FFmpeg, libdc1394, FlyCapture, Spinnaker, OpenKinect, librealsense, CL PS3 Eye Driver, videoInput, ARToolKitPlus, flandmark, Leptonica, and Tesseract) and provides utility classes to make their functionality easier to use on the Java platform, including Android. JavaCV also comes with hardware accelerated full-screen image display (CanvasFrame and GLCanvasFrame), easy-to-use methods to execute code in parallel on multiple cores (Parallel), user-friendly geometric and color calibration of cameras and projectors (GeometricCalibrator, ProCamGeometricCalibrator, ProCamColorCalibrator), detection and matching of feature points (ObjectFinder), a set of classes that implement direct image alignment of projector-camera systems (mainly GNImageAligner, ProjectiveTransformer, ProjectiveColorTransformer, ProCamTransformer, and ReflectanceInitializer), and more.
    Downloads: 12 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 5
    JimuReport

    JimuReport

    Open source drag-and-drop reporting and dashboard builder platform

    ...JimuReport also includes components for building interactive charts, data tables, and analytical displays that can be used in enterprise applications. It can connect to multiple data sources and retrieve data through SQL queries, APIs, or other structured formats. It can be embedded into Java applications using Spring Boot integration modules.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 6
    eSearch

    eSearch

    screen recognition and search

    Downloads: 22 This Week
    Last Update:
    See Project
  • 7
    Deep Chat

    Deep Chat

    Customizable AI chat component for websites with API support

    Deep Chat is a highly customizable web component designed to simplify the integration of AI-powered chat interfaces into websites. It allows developers to embed a fully functional chatbot using minimal setup, while still offering extensive control over behavior, appearance, and integrations. Deep Chat supports connections to a wide range of AI services as well as custom backends, enabling flexible deployment for different use cases. It is built as a framework-agnostic solution, meaning it...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    ViZDoom

    ViZDoom

    Doom-based AI research platform for reinforcement learning

    ViZDoom allows developing AI bots that play Doom using only the visual information (the screen buffer). It is primarily intended for research in machine visual learning, and deep reinforcement learning, in particular. ViZDoom is based on ZDOOM, the most popular modern source-port of DOOM. This means compatibility with a huge range of tools and resources that can be used to create custom scenarios, availability of detailed documentation of the engine and tools and support of Doom community....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Provides optical character recognition (OCR) solutions for Vietnamese language.
    Leader badge
    Downloads: 163 This Week
    Last Update:
    See Project
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 10
    Robocode

    Robocode

    Robocode is a programming tank game for Java

    Robocode is a programming game, where the goal is to develop a robot battle tank to battle against other tanks with Java. The robot battles are running in real-time and on-screen. The motto of Robocode is: Build the best, destroy the rest!
    Leader badge
    Downloads: 417 This Week
    Last Update:
    See Project
  • 11
    OculiX

    OculiX

    Visual Automation IDE — automate anything you see on screen

    OculiX is the evolution of SikuliX, actively maintained with the full agreement of its original creator RaiMan. Automate any desktop application using image recognition (OpenCV) and OCR (Tesseract + PaddleOCR). No access to source code or DOM required — if you can see it, you can automate it. Key features: - Guided step-by-step recorder with live code preview - Image recognition via OpenCV 4.10 - Dual OCR: Tesseract (built-in) + PaddleOCR (neural, high precision) - Local and remote...
    Leader badge
    Downloads: 159 This Week
    Last Update:
    See Project
  • 12

    Image To Text tools

    ITTT is a Free tool designed to Scan and extract Text from Images.

    Image To Text Tools is a 100% Free user-friendly tool designed to Scan and extract containing text in images into editable text formats. Whether you need to extract text from scanned documents, photographs, or other image files, Image To Text Tools provides accurate and reliable Optical Character Recognition (OCR) capabilities to meet your needs.
    Downloads: 29 This Week
    Last Update:
    See Project
  • 13
    Dual Clip Translator
    Translation of Selected text or Clipboard contents powered by Google. HotKeys Paste/Change Text auto translated. View in Balloon/Window the result of translation, besides being sent to the clipboard. Screen Capture of Desktop/Game > OCR > Translated.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 14
    html2canvas

    html2canvas

    A JavaScript HTML screenshot renderer

    html2canvas is a JavaScript HTML renderer. The script provides you with the tools to take screenshots of webpages directly on the browser. The screenshot is based on the DOM and therefore, it may not be 100% accurate to the real representation, given that it is not an actual screenshot, but a type of screenshot built based on the available data and information of the page. The script renders such page as a canvas image, by reading the DOM and the different styles of the featured elements. It...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 15
    Next Generation Programming

    Next Generation Programming

    Compose Software Without Writing Any Programing Code

    "Next Generation Programming - Programming Without Coding Software" is a drag-drop wizard for creating simple or complex applications without writing any programming language code The Software is coded/designed with "Java Programming Language" for novice/expert programmers; Programmers can write softwares with visual tools : drag-drop components;visual editors... Programmers can use the software to compose of simple/complex applications : Database programs, circuit design, generate...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 16
    Douyin-Bot

    Douyin-Bot

    Python TikTok bot

    ...Its original goal was playful and experimental, focused on browsing and identifying content based on computer vision and face analysis. It is best understood as a demonstration of app automation, screen capture, API-based image analysis, and scripted mobile interaction. Its main value is showing how Python can orchestrate a phone app workflow by combining ADB, screenshots, recognition logic, and automated gestures.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    BossSensor

    BossSensor

    Hide screen when boss is approaching

    BossSensor is an experimental open-source application that uses computer vision and machine learning to detect when a specific person, such as a supervisor or manager, approaches a computer workstation. The project uses a webcam to continuously capture images and analyze them using a face classification model trained to distinguish between the designated “boss” and other individuals. When the system detects that the trained face appears in the camera view, the program automatically triggers actions such as hiding the user’s screen or switching to a safe display. The software relies on libraries such as OpenCV, TensorFlow, and Python-based machine learning tools to perform face detection and classification. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Panzer Combat II

    Panzer Combat II

    Computer-assisted miniature tank game.

    Panzer Combat II is a multi-player voice and webcam enabled computer-assisted distributed miniature wargame of World War II tank combat. Firing is done by placing a webcam behind the aiming unit. Distance to target is computed using computer vision. Action inside the tanks is performed on the computer screen while battlefield strategy is played on the miniature terrain. Both camps can use a different laptop or tablet, the game will interconnect. You can try it online :...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 19

    smartblob

    tiny code, html webcam game or plug brain in each blob, java server

    ...I'm planning a huge multiplayer world, some blobs played by people holding a bendable loop game controller (tape a 1 meter cut of thick extension cord into a loop) in front of webcam and bend it to bend your blob on screen, and other blobs controlled by AI. This is a game for general AI research in a fun way people can do together. The physics and vision algorithm is half working in version 0.3.0. The gameplay is better experienced in 0.2 which is controlled with the mouse so you dont have enough freedom of movement compared to webcam. This small file contains its own source code including occamserver, a tiny general java server I built which I'll adjust to allow http streaming connections to stay open for faster ajax than a new web call each time. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    physicsmata

    API for all possible cellular automata that work same at all angles

    ...This could be used as a game interface for evolvable musical instruments or fluid puzzle games. Physicsmata is similar in effect to SmoothLife but simpler and pure Java. The cellular automata API takes a function to run at each point. Its parameters are sums of screen brightness (n color dimensions are optional) at concentric circles around each point, efficiently costing only the perimeter of those circles (instead of the area) because of the tree of squares. You can create any function, or to get started try the visual function editor where each click changes if a column is sine/sigmoid/half/double/exp/etc or which of the 0, 1, or 2 columns to the left are its params. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    ACT-CV provides an easy to use interface between visual objects on the computer screen (or another video source) and a cognitive modeling tool.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    Chordalysis

    Log-linear analysis (data modelling) for high-dimensional data

    ...Associated papers at ICDM 2013, ICDM 2014 and SDM 2015 can be found at http://www.francois-petitjean.com/Research/ YourKit is supporting Chordalysis open source project with its full-featured Java Profiler. YourKit is the creator of innovative and intelligent tools for profiling Java and .NET applications. http://www.yourkit.com
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    lavalamp>3

    screen colors get changed by a neural net

    On the full screen or a frame the colors get changed dynamically by using a image recognition neural net.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    TronTool

    TronTool

    Tuning of java neuroph.org neural nets.

    By using a neural net from neuroph.org with the programmlanguage Java, you can change the net by this software. Easy watch it on the screen and move the values up or down. More or less value at the output neurons. Saving, Loading from different .nnet`s. Adding comments. Changing the visible output neuron Setting wished input neurons to get visible. A small fast and easy weight version and a continuing version, which should get involved in the neuroph project.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Developing application which will automatically detect vehicle coming at entrance and exit using LDR sensors and detect number of vehicle using OCR implemented in java and should produce bill.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
Auth0 Logo