Showing 18 open source projects for "video-subtitle-extract"

View related business solutions
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    LLPlayer

    LLPlayer

    The media player for language learning, with dual subtitles

    LLPlayer is an open-source media player designed specifically for language learning through video content. Unlike traditional media players, the application focuses on advanced subtitle-related features that help learners understand and interact with foreign language media more effectively. The player supports dual subtitles so users can simultaneously view text in both the original language and their native language while watching videos.
    Downloads: 39 This Week
    Last Update:
    See Project
  • 2
    SwarmUI

    SwarmUI

    Modular AI image and video generation web UI with extensible tools

    SwarmUI is a modular web-based user interface designed for AI-driven image generation, with a strong focus on usability, performance, and extensibility. It serves as a unified environment for working with multiple AI models, including Stable Diffusion and newer image and video generation systems, allowing users to create and manage outputs through a browser interface. SwarmUI is built to accommodate both beginners and advanced users by offering a simple “Generate” interface alongside more advanced workflow tools that expose deeper configuration options. It integrates with underlying systems like node-based workflows, enabling flexible and customizable pipelines for complex generation tasks. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 3
    MiniMax Skills

    MiniMax Skills

    Development skills for AI coding agents

    MiniMax skills is a modular system designed to provide structured development capabilities for AI coding agents, enabling them to perform complex engineering tasks with guided workflows and domain-specific knowledge. It defines a set of reusable “skills” that encapsulate best practices, architectural patterns, and step-by-step processes for building applications across multiple platforms. These skills can be integrated into AI tools to improve the quality and consistency of generated code,...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 4
    .NET for Apache Spark

    .NET for Apache Spark

    A free, open-source, and cross-platform big data analytics framework

    .NET for Apache Spark provides high-performance APIs for using Apache Spark from C# and F#. With these .NET APIs, you can access the most popular Dataframe and SparkSQL aspects of Apache Spark, for working with structured data, and Spark Structured Streaming, for working with streaming data. .NET for Apache Spark is compliant with .NET Standard - a formal specification of .NET APIs that are common across .NET implementations. This means you can use .NET for Apache Spark anywhere you write...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 5
    Generative AI for Beginners .NET

    Generative AI for Beginners .NET

    Hands-on .NET course for building real-world generative AI apps

    Generative AI for Beginners .NET is a hands-on course that helps developers build real-world AI applications using the .NET ecosystem. It walks through core concepts such as text generation, chat-based interactions, and integrating large language models into applications. Each lesson includes short videos, working code samples, and step-by-step instructions, making it easy to follow and apply immediately. Generative AI for Beginners .NET supports tools like GitHub Models, Azure OpenAI...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Glint Translator
    ...Features • 3 Translation Modes: Fluent (parallel), Area (overlay), Full Screen (smart detection) • Speaker detection with color-coding • Glint AI custom terminology control • Game-based profile system • Advanced settings with 50+ parameters for fine-tuned control • Share and import custom profiles (.glint) between users • Low CPU/RAM usage, optimized for Windows 10/11 Live Subtitle (Real-Time Voice Translation) Real-time speech-to-text translation for games, movies, and voice chats. Automatically detects audio, converts speech to text, and translates it instantly. Example: They speak German → you see Turkish AI Model Support • Google Gemini: 2.5 Flash, 2.5 Pro • OpenAI: GPT-4o, GPT-4 Turbo
    Downloads: 21 This Week
    Last Update:
    See Project
  • 7

    Image To Text tools

    ITTT is a Free tool designed to Scan and extract Text from Images.

    Image To Text Tools is a 100% Free user-friendly tool designed to Scan and extract containing text in images into editable text formats. Whether you need to extract text from scanned documents, photographs, or other image files, Image To Text Tools provides accurate and reliable Optical Character Recognition (OCR) capabilities to meet your needs.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 8
    TikTok Filter Remover

    TikTok Filter Remover

    Removes some filters from TikTok videos

    Introducing our groundbreaking app designed to redefine your TikTok video editing experience! Tired of being limited by filters in downloaded TikTok videos? Our innovative program empowers you to effortlessly remove select filters, giving you unparalleled control over your content. Utilizing an intuitive interface, our app harnesses sophisticated algorithms and the robust capabilities of the Midjourney API to diligently reconstruct frames, striving for optimal results.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    Microsoft AI Lab

    Microsoft AI Lab

    Explore, learn, and code the latest breakthrough AI innovations

    Explore our most popular projects and experiments, demonstrating the possibilities of AI. Jumpstart your own AI innovations with learning resources and development solutions from Microsoft AI. Explore insights and behind-the-scenes technology for breakthrough AI innovations. From Tech Minutes videos to Technology Deep Dives, learn about the engineering that powers the future of AI. AI at Scale is expanding the possibilities of AI innovation by pushing the boundaries of infrastructure,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    JATI - Just Another Tesseract Interface

    JATI - Just Another Tesseract Interface

    Another interface for tesseract OCR to convert image to text.

    Tesseract OCR is an open source, highly accurate image to text converter. Nevertheless, Tesseract OCR provides only command line interface. JATI is just another interface to the Tesseract OCR engine, providing GUI interface to convert an image to text. It can do batch conversion, including converting only portion of the image into text.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    JAVT - Just Another Voice Transformer

    JAVT - Just Another Voice Transformer

    Just Another Speech Recognition and Text to Speech software.

    JAVT or Just Another Voice Transformer (formerly, it is called Just Another Video Transcriber) is a Speech Recognition software that also support text to Speech and simple media conversion. JAVT allows you to convert from video files to audio wav file using ffmpeg, and then transcribe the audio file to text using either Microsoft SAPI or CMU Sphinx. You can also open a text file and allow JAVT to read it out for you through text to speech conversion.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    reacTIVision
    reacTIVision is a computer vision framework for the fast and robust tracking of markers attached on physical objects, and the creation of multi-touch surfaces. It was designed for the rapid development of table-based tangible user interfaces.
    Downloads: 121 This Week
    Last Update:
    See Project
  • 13
    Langton's Ant by Lumpiluk
    This is my implementation of Langton's Ant (a popular turing machine, see http://en.wikipedia.org/wiki/Langton%27s_ant for more information) using C#. Supports multiple ants and threading. Video: http://www.youtube.com/watch?v=Vu9C_1DH31k
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    SubHub

    SubHub

    Post-OCR correction tool for SRT subtitles

    A post-OCR correction tool for SRT subtitle files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    ACAFE-clinical decision support EMR

    A computerised clinical decision support advancement for emergency med

    ACAFE-a computerised clinical decision support advancement for emergency medicine. Hospital electronic medical record EHR or EMR. Used commercially. Other uses include research-development into chronic and infectious-diseases. Watch video here of demo https://www.youtube.com/watch?v=SGlicILHhJs
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    RoboRealm Extensions are plugins to the RoboRealm application that allow you to extend RoboRealm in whatever way you need. RoboRealm is a powerful computer vision based application for use in machine vision, image analysis, and image processing systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    A .NET wrapper for OpenCV, a computer vision library. The wrapper is fully developed in C#, making the OpenCV library available to all the .NET based languages, like C#, VB.NET, managed C++, IronPython, F#, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    The Speech Recognition Media Player is designed to browse and play your music and videos only with your voice. Plug in a remote microphone to your PC and use it as a Remote Control. Really helpful for the visually handicapped.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB