Showing 21 open source projects for "video to text"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    Video-subtitle-extractor

    Video-subtitle-extractor

    A GUI tool for extracting hard-coded subtitle (hardsub) from videos

    Video hard subtitle extraction, generate srt file. There is no need to apply for a third-party API, and text recognition can be implemented locally. A deep learning-based video subtitle extraction framework, including subtitle region detection and subtitle content extraction. A GUI tool for extracting hard-coded subtitles (hardsub) from videos and generating srt files.
    Downloads: 57 This Week
    Last Update:
    See Project
  • 2
    AudioNotes

    AudioNotes

    Extract audio and video content and organize it into a Markdown note

    AudioNotes is an application (or proof-of-concept) that likely combines audio recording or playback with note-taking or annotation functionality — enabling users to record voice or audio and attach textual or timestamped notes, making it ideal for lectures, interviews, meetings, or personal memos. Such a tool offers a more expressive and flexible way to capture and revisit information: instead of just typed notes or raw audio, users get both audio context and structured notes. As an...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    myplayer Free Karaoke Software

    myplayer Free Karaoke Software

    myplayer Free Karaoke & Media Player Software (Myanmar)

    myplayer2k22 is a video player application (Computer Software) that not only lets you easily find and sing karaoke songs, but also lets you easily find and watch movies. Also compatible with Android Phone/Tablet [myplayer remote] app. Karaoke device (PC) doesn't have the song you want to sing, but you can sing with the karaoke song file on your phone, so it's convenient if you have your favorite song on your phone.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 4
    xSTUDIO

    xSTUDIO

    xSTUDIO is a high performance playback and review tool.

    xSTUDIO is a high performance playback and review tool designed by and for Visual Effects, Animation and Post Production professionals. The application can load and play large collections of media files. The efficient playback engine allows you to quickly load and play high resolution image formats with a wide range of file formats and encoding. Intuitive tools allow you to create and organise playlists and media sub-sets within playlists to build interactive review sessions, image and video...
    Leader badge
    Downloads: 19 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 5

    Clipstitch

    Uility to make home movies from your digital camera files

    Full documentation: Download clipstitchX.Y.html To make movies from your camera (or phone) video files. FFmpeg is a professional-quality, free, open-source program for video editing, with the ability to implement a huge number of operations and handle every data format! This kind of ability comes at a cost: its commands are quite complex-looking and difficult to use and remember. Clipstitch runs as a front-end to ffmpeg so that you use only the sub-set of ffmpeg commands necessary for making a home movie from your digital camera, puts them in easier-to-read form, and internally combines multiple ffmpeg commands to do certain tasks.. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    VATSG

    VATSG

    Video automatic transcribe and translated subtitle generator

    ...This is the subtitle generator(VATSG) which use [moviepy](https://github.com/Zulko/moviepy) to generate mp3 and then use [faster-whisper](https://github.com/guillaumekln/faster-whisper) to get text recognition and then use deepl-api to generate your target language subtitle file(srt format) If you are a general user who want to view any video file and mp3 file to your language, It will provide way. It's very easy to use because it has simple gui and very intuitive. So you can easily use it for any purpose. Now, you can choose to download either window installer setup type or uninstalled type. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    AugLy

    AugLy

    A data augmentations library for audio, image, text, and video

    AugLy is a data augmentations library that currently supports four modalities (audio, image, text & video) and over 100 augmentations. Each modality’s augmentations are contained within its own sub-library. These sub-libraries include both function-based and class-based transforms, composition operators, and have the option to provide metadata about the transform applied, including its intensity. AugLy is a great library to utilize for augmenting your data in model training, or to evaluate the robustness gaps of your model! ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8

    srt2vobsub

    A command-line tool that converts text subtitles to vobsub

    A command line tool for Linux and other Unix-like systems that generates a pair of .idx/.sub vobsub subtitles files from textual subtitles.
    Leader badge
    Downloads: 6 This Week
    Last Update:
    See Project
  • 9
    Linux-Intelligent-Ocr-Solution

    Linux-Intelligent-Ocr-Solution

    Easy-OCR solution and Tesseract trainer for GNU/Linux

    Linux-intelligent-ocr-solution Lios is a free and open source software for converting print in to text using either scanner or a camera, It can also produce text out of scanned images from other sources such as Pdf, Image, Folder containing Images or screenshot. Program is given total accessibility for visually impaired. A Tesseract Trainer GUI is also shipped with this package. Forum : https://groups.google.com/forum/#!forum/lios Video Tutorial : https://www.youtube.com/playlist?...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10

    FastoCloud PRO

    IPTV/NVR/CCTV/Video cloud https://fastocloud.com

    IPTV/Video cloud Features: Cross-platform (Linux, MacOSX, FreeBSD, Raspbian/Armbian) GPU/CPU Encode/Decode/Post Processing Stream statistics CCTV Adaptive hls streams Load balancing Temporary urls HLS push EPG scanning Subtitles to text conversions AD insertion Logo overlay Video effects Relays Timeshifts Catchups Playlists Restream/Transcode from online streaming services like Youtube, Twitch Mozaic Many Outputs Physical Inputs Streaming Protocols File Formats Presets Vods/Series server-side support Pay per view channels Channels on demand HTTP Live Streaming (HLS) server-side support Public API, client server communication via JSON RPC Protocol gzip compression Deep learning video analysis Supported deep learning frameworks: Tensorflow NCSDK Caffe ML Hardware:
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    PyTorch Natural Language Processing

    PyTorch Natural Language Processing

    Basic Utilities for PyTorch Natural Language Processing (NLP)

    PyTorch-NLP is a library for Natural Language Processing (NLP) in Python. It’s built with the very latest research in mind, and was designed from day one to support rapid prototyping. PyTorch-NLP comes with pre-trained embeddings, samplers, dataset loaders, metrics, neural network modules and text encoders. It’s open-source software, released under the BSD3 license. With your batch in hand, you can use PyTorch to develop and train your model using gradient descent. For example, check out...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    oscp

    oscp

    multiplatform, small and handy audio/video player with network remote

    oscp by ariel/KotCzarny @ irc://irc.freenode.com/h3droid tiny media player: - libav (mp3, wav, ogg, aac, mpc, wma, flac, ape, avi, mkv, flv, etc.) - wildmidi (mid) - xmp (mod, med, xm, s3m, it, dbm, psm, omx, okt, digi, 669, mtm, acid, umx) - gme (ay, gbs, gym, hes, kss, nsf, nsfe, sap, spc, vgm) - sidplay2 (sid, psid, info) - mdxplay (mdx) - fc14dec (fc, fc13, fc14) - sc68 (sc68, sndh) - asapconv (sap, cmc, cm3, cmr, cms, dmc, dlt, mpt, mpd, rmt, tmc, tm8, tm2) -...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    MysticBBS Shoutcast Panel

    MysticBBS Shoutcast Panel

    Python Script for Mystic BBS Shoutcast Dashboard

    This script was created so I can display various information to my BBS users about my shoutcast radio stream. This BBS section includes a shoutcast dashboard with station information obtained via RSS/XML. The display includes listener peak, current song playing, next song to play, stream status, stream URL and other pertinent information for the BBS user to see. After the dashboard is displayed, the script continues by showing the last 20 songs played on the station. Useful...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    JAVT - Just Another Voice Transformer

    JAVT - Just Another Voice Transformer

    Just Another Speech Recognition and Text to Speech software.

    JAVT or Just Another Voice Transformer (formerly, it is called Just Another Video Transcriber) is a Speech Recognition software that also support text to Speech and simple media conversion. JAVT allows you to convert from video files to audio wav file using ffmpeg, and then transcribe the audio file to text using either Microsoft SAPI or CMU Sphinx. You can also open a text file and allow JAVT to read it out for you through text to speech conversion.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    PyVcon

    PyVcon

    A stylish Video Converter written in Python

    PyVcon is a Python video converter using PyQt as its primary GUI Toolkit and because of this, PyVcon has a very sleek user friendly interface. Using ffmpeg for video conversion, PyVcon has great performance in speed and converts any kind of video into mp4, mkv, wmv, avi, 3gp, m4a, mp3 and wma formats. Also included, is MediaInfo who PyVcon partly depends for video metadata generation.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    UpStage
    WE ARE NO LONGER USING SOURCEFORGE. Please visit http://www.upstage.org.nz for the most up-to-date code (v3 to be released january 2014, beta version available November 2013) and information. UpStage is a web-based venue for cyberformance: artists compile digital media in real time to create live theatrical performance for online audiences.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    EnKoDeur-Mixeur
    EnKoDeur-Mixeur (EKD) is an open source software which makes videos, pictures and audio post-production. It can be also used to convert videos in many formats. It is written in python and use the PyQt4 bindings.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    GOFoto is an appliaction for managing large collections of photos. It allows photo refining, generating web gallery and VideoCD.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Whyteboard is a painting whiteboard application for Linux and Windows, that allows the annotation of PDF and PostScript documents, and image files with common drawing tools.
    Leader badge
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    zznotes is a GTK+ based notes tool for multiple media, zznotes can store text,image,audio,video and other binary file in single database file, and implement a user-friendly interface to access the media.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Gaupol is a subtitle editor for text based subtitle files (e.g. SubRip and MicroDVD).
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB