Showing 30 open source projects for "java video codecs"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    Qwen3-Omni

    Qwen3-Omni

    Qwen3-omni is a natively end-to-end, omni-modal LLM

    ...It achieves state-of-the-art results: across 36 audio and audio-visual benchmarks, it hits open-source SOTA on 32 and overall SOTA on 22, outperforming or matching strong closed-source models such as Gemini-2.5 Pro and GPT-4o. To reduce latency, especially in audio/video streaming, Talker predicts discrete speech codecs via a multi-codebook scheme and replaces heavier diffusion approaches.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    OpenCV

    OpenCV

    Open Source Computer Vision Library

    OpenCV (Open Source Computer Vision Library) is a comprehensive open-source library for computer vision, machine learning, and image processing. It enables developers to build real-time vision applications ranging from facial recognition to object tracking. OpenCV supports a wide range of programming languages including C++, Python, and Java, and is optimized for both CPU and GPU operations.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 3
    txtai

    txtai

    Build AI-powered semantic search applications

    txtai executes machine-learning workflows to transform data and build AI-powered semantic search applications. Traditional search systems use keywords to find data. Semantic search applications have an understanding of natural language and identify results that have the same meaning, not necessarily the same keywords. Backed by state-of-the-art machine learning models, data is transformed into vector representations for search (also known as embeddings). Innovation is happening at a rapid...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    OpenCV

    OpenCV

    Open Source Computer Vision Library

    The Open Source Computer Vision Library has >2500 algorithms, extensive documentation and sample code for real-time computer vision. It works on Windows, Linux, Mac OS X, Android, iOS in your browser through JavaScript. Languages: C++, Python, Julia, Javascript Homepage: https://opencv.org Q&A forum: https://forum.opencv.org/ Documentation: https://docs.opencv.org Source code: https://github.com/opencv Please pay special attention to our tutorials!...
    Leader badge
    Downloads: 3,191 This Week
    Last Update:
    See Project
  • Catch Bugs Before Your Customers Do Icon
    Catch Bugs Before Your Customers Do

    Real-time error alerts, performance insights, and anomaly detection across your full stack. Free 30-day trial.

    Move from alert to fix before users notice. AppSignal monitors errors, performance bottlenecks, host health, and uptime—all from one dashboard. Instant notifications on deployments, anomaly triggers for memory spikes or error surges, and seamless log management. Works out of the box with Rails, Django, Express, Phoenix, Next.js, and dozens more. Starts at $23/month with no hidden fees.
    Try AppSignal Free
  • 5
    Google2SRT

    Google2SRT

    Download, save and convert multiple subtitles from YouTube videos

    Google2SRT allows you to download, save and convert multiple subtitles and translations from YouTube and Google Video to SubRip (.srt) format, which is recognized by most video players. You can download XML subtitles or simply type video's URL, Google2SRT will do the rest.
    Downloads: 36 This Week
    Last Update:
    See Project
  • 6
    MyBox

    MyBox

    Easy Tools of PDF, Image, File, Network, Data, and Medias

    javafx-desktop-apps pdf image ocr icc barcode color-palette text bytes markdown html archive compress digest video audio editor converter media https://github.com/Mararsh/MyBox Self-contain packages need not java env nor installation. Jar packages need Java 16 or higher.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    File-Studio

    File-Studio

    A tool that automates complex file operations.

    File studio is a tool that assists in handling complex file operations such as bulk renaming, organizing folders and more.
    Leader badge
    Downloads: 18 This Week
    Last Update:
    See Project
  • 8
    Conversations

    Conversations

    App in java for chatting to a generative A.I. (involving tts and stt)

    Java application for chatting to generative AI Llama3. * The user can speak into the microphone (speechToText), edit the recognized text and send it to the AI. * The AI ​​responds and the server returns that response in real time, and the sentences converted to audio (textToSpeech), and the application broadcasts them through the speaker.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    AlphaPlayer

    AlphaPlayer

    AlphaPlayer is a video animation engine

    AlphaPlayer is positioned as a multimedia or media-player library or application under ByteDance, likely intended to provide video/audio playback functionality, streaming, or media rendering capabilities. It probably serves as a foundation for building media-heavy applications — offering features like playback control, streaming support, adaptive media handling, and possibly integration with custom codecs or streaming protocols. For developers building web, desktop, or mobile applications needing stable audio/video playback and streaming — perhaps for ByteDance’s own media apps — AlphaPlayer could provide a ready-to-use or extendable library to embed media capabilities. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Auth0 B2B Essentials: SSO, MFA, and RBAC Built In Icon
    Auth0 B2B Essentials: SSO, MFA, and RBAC Built In

    Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

    Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.
    Sign Up Free
  • 10
    insofts player
    insofts-player Free media player, with which you can easily and conveniently view video and listen to audio files in various formats, without installing additional codecs. View streaming video, audio.   Constantly updating the online media library Additional features: sound recording, uart protocol support, speech to text
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Marvin Image Processing Framework
    Marvin is an image processing framework that provides features for image and video frame manipulation, multithreading image processing, image filtering and analysis, unit testing, performance analysis and addition of new features via plug-in.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    reacTIVision
    reacTIVision is a computer vision framework for the fast and robust tracking of markers attached on physical objects, and the creation of multi-touch surfaces. It was designed for the rapid development of table-based tangible user interfaces.
    Downloads: 72 This Week
    Last Update:
    See Project
  • 13
    Scene
    Scene is a computer vision framework that performs background subtraction and object tracking, using two traditional algorithms and three more recent algorithms based on neural networks and fuzzy classification rules. For each detected object, Scene sends TUIO messages to one or several client applications. The present release features GPU accelerated versions of all the background subtraction methods and morphological post processing of the object blobs with dilation and erosion filters,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    JAABA

    The Janelia Automated Animal Behavior Annotator

    The Janelia Automatic Animal Behavior Annotator (JAABA) is a machine learning-based system that enables researchers to automatically compute interpretable, quantitative statistics describing video of behaving animals. Through our system, users encode their intuition about the structure of behavior by labeling the behavior of the animal, e.g. walking, grooming, or following, in a small set of video frames. JAABA uses machine learning techniques to convert these manual labels into behavior...
    Leader badge
    Downloads: 18 This Week
    Last Update:
    See Project
  • 15
    ACT-CV provides an easy to use interface between visual objects on the computer screen (or another video source) and a cognitive modeling tool.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Visage is a human computer interface that aims to replace the traditional mouse with the face. Using a webcam and Visage the movement of the face becomes the movement of the mouse pointer. Left/right Eye blinks fire left/right mouse click events.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Anthromorphic Scribe

    Anthromorphic Scribe

    Provides speech to text gui to sphinx4

    It provides an interactive speech to text application that uses sphinx 4. With this you can use pre-recorded audio, record your own voice and convert incompatible audio/video to be compatible with sphinx 4. It currently supports U.S English by using hub4 acoustic and language model.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Java Behaviour Trees is a Java framework for easily building and running generic behaviour trees. Here, "behaviour tree" refers to the technique used to control the behaviour of characters in video games. IMPORTANT NOTE: our source code has been moved to GitHub. You will find it at: https://github.com/gaia-ucm/jbt Note however that all the binary files will still be distributed here at SourceForge.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19

    Black Hole Cortex

    Sphere surface layers of visual cortex approach maximum info density

    Near the surface (even horizon) of a black hole, there is maximum information density in units of squared plancks (and some translation to qubits). Similarly, our imagination is the set of all possible things we can draw onto our most dense layer of visual cortex in electricity patterns. Bigger layers have more neurons to handle those possibilities. A Black Hole Cortex is a kind of visual cortex that has density of neuron layers similar to density at various radius from a black hole. What we...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Tower of Zaldagor

    Tower of Zaldagor

    The classic intelligent role playing game

    Tower of Zaldagor is a roleplaying game (RPG) where you need to use your wits and skills to complete challenges. Blade Kahn, a king of Sonthalas, has lost his daughter to the Great White Dragon: Zaldagor. She has been kidnapped by humanoids. You have been picked among the elite to return his daugther safely. Engage into an epic story where Gods change the future and demons interfere with the present. Download and discover the beginning of an epic story. The Android version is now available!...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Swarm Wars

    Swarm Wars

    Safety in numbers.

    REPOSITORY MOVED TO GITHUB: https://github.com/happyjack27/SwarmWars video sample: http://youtu.be/s5mLNbdBQGY A game where you evolve & compete AI swarms. The organisms use swarm intelligence & ant colony optimization. The organisms can communicate through 3-color signaling as well as by laying beacons. They can attack and repair other organisms. They can select mates, and they can gather and distribute food and material. This behavior is controlled by a genetically evolved...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    RDOTE
    R. is a GNU/GPL licensed framework for transporting data residing in RDB into the Semantic Web, which provides a friendly GUI, as well as enough expressivity for creating RDF dumps of RBD data. Video: http://www.youtube.com/watch?v=pk7izhFeuf0
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    BayesianCortex

    BayesianCortex

    simple algorithm for a realtime interactive visual cortex for painting

    A paint program where the canvas is the visual cortex of a simple kind of artificial intelligence. You paint with the mouse into its dreams and it responds by changing what you painted gradually. There will also be an API for using it with other programs as a general high-dimensional space. Each pixel's brightness is its own dimension. Bayesian nodes have exactly 3 childs because that is all thats needed to do NAND in a fuzzy way as Bayes' Rule which is NAND at certain extremes. NAND can be...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Interactive4J
    Project aim to provide simple easy APIs for Java developers to use interactive abilities in their Java Applications like speech recognition, handwriting recognition, use of web cam , sound record/play, decision trees , text to speech and many others.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    The FlashServer is a part of the MASSim project of the Computational Intelligence Group at the TU Clausthal, Germany and his main task is the visualization of the games as Flash videos. Currently the FS is not in use during the Multi-Agent Contest.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB