Search Results for "transcription linux" - Page 2

Showing 46 open source projects for "transcription linux"

View related business solutions
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 1
    FireRedASR

    FireRedASR

    Open-source industrial-grade ASR models

    FireRedASR is an industrial-grade family of open-source automatic speech recognition models designed to provide high-precision speech-to-text performance across languages including Mandarin, English, and various Chinese dialects, achieving new state-of-the-art benchmarks on public test sets. The project includes multiple model variants to meet different application needs, such as high-accuracy end-to-end interaction using an encoder-adapter-LLM framework and efficient real-time recognition...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    txtai

    txtai

    Build AI-powered semantic search applications

    txtai executes machine-learning workflows to transform data and build AI-powered semantic search applications. Traditional search systems use keywords to find data. Semantic search applications have an understanding of natural language and identify results that have the same meaning, not necessarily the same keywords. Backed by state-of-the-art machine learning models, data is transformed into vector representations for search (also known as embeddings). Innovation is happening at a rapid...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    footswitch2

    footswitch2

    Audio Transcription software for Linux (Vlc) with a foot pedal

    ...There are multiple timestamp macros, along with media player control macros for Play/Pause toggle Jump backwards Jump forwards Slower Faster Normal Audio speed Jump to timestamp Allows for multiple transcription clients, database export for spreadsheet invoicing. This combination of fs2 and libreoffice unlike some other transcription tools, means you are working in a world class editor, with the ability to save documents in multiple formats, including Microsoft Word formats.
    Downloads: 21 This Week
    Last Update:
    See Project
  • 4
    SPPAS

    SPPAS

    SPPAS - the automatic annotation and analyses of speech

    SPPAS is a scientific computer software package written and maintained by Brigitte Bigi of the Laboratoire Parole et Langage, in Aix-en-Provence, France. Available for free, with open source code, there is simply no other package for linguists to simple use in the automatic annotations of speech, the analyses of any kind of annotated data and the conversion of annotated files. SPPAS is able to produce automatically speech annotations from a recorded speech sound and its orthographic...
    Downloads: 14 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    footswitch3

    footswitch3

    Audio Transcription software for Linux (Gstreamer) with a foot pedal

    ...There are 3 timestamp macros, along with media player control macros for Play/Pause toggle Jump backwards Jump forwards Slower Faster Normal Audio speed Jump to timestamp Allows for multiple transcription clients, database export for spreadsheet invoicing. This combination of fs3 and LibreOffice unlike some other transcription tools, means you are working in a world class editor, with the ability to save documents in multiple formats, including Microsoft Word formats.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 6
    footswitch2basic

    footswitch2basic

    Audio Transcription software for Linux (Vlc) with a foot pedal

    Footswitch 2 (Basic) is a media player for transcribers on Linux. This version is a stripped down version of Footswitch2, containing only the absolute essentials for transcription. Written in python and using the python bindings for VLC it allows a transcriber to control the audio or video with a footpedal, and includes a set of macros that integrate into LibreOffice. This allows the transcriber to control the media player from within Libreoffice as well, making it useful for those who do not yet own a foot pedal. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    footswitch3basic

    footswitch3basic

    Audio Transcription software for Linux (Gstreamer) with a foot pedal

    Footswitch3basic is a media player for transcribers on Linux. Written in python using the bindings for Gstreamer it allows a transcriber to control the audio or video with a foot pedal, and includes a set of macros that integrate into LibreOffice. This allows the transcriber to control the media player from within Libreoffice as well, making it useful for those who do not yet own a foot pedal/foot switch.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Footswitch2 Equaliser

    Footswitch2 Equaliser

    15 band pulseaudio equaliser

    15 band audio equaliser originally intended for use with Footswitch2 transcription tools but will happily run independently. This Linux python utility provides a GUI front end to modify the sound of audio using pulseaudio's ladspa module with Steve Hariss' mbeq_1197 and Frank Neumann's split_1406 plugins. Multiband equaliser and Mono to Stereo splitter respectively. Pulseaudio must be up and running for this software to work.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Piano transcription

    Piano transcription

    Task of transcribing piano recordings into MIDI files

    Piano transcription is an open-source high-resolution piano transcription system by ByteDance that converts raw audio recordings of piano performance into symbolic MIDI files — detecting note onsets, offsets, pitch, velocity, and even pedal usage. The system is implemented in Python (PyTorch) and is capable of accurate transcription of polyphonic piano recordings, even with complex passages and pedal techniques, making it suitable for classical piano music. By using this transcription tool,...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Let your crypto work for you

    Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 10
    GiantMIDI-Piano

    GiantMIDI-Piano

    Classical piano MIDI dataset

    GiantMIDI-Piano is a large-scale symbolic classical piano music dataset built by applying the piano_transcription system on a vast collection of piano performance recordings. The dataset contains thousands of piano works, spanning a large number of composers and styles, with each piece transcribed into high-precision MIDI files capturing note events, pedal usage, velocities, etc. It provides a resource for music information retrieval (MIR), symbolic music modeling, composer classification,...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11

    MutaNET

    NGS Pipeline and Automated Mutation Analysis

    MutaNET comes with a next generation sequencing (NGS) pipeline that calls mutations based on paired-end NGS reads, an automated mutation analysis tool and various file converters and mergers. The mutation analysis feature considers the coding region, protein domains, regulation and transcription factor binding site information, and can be used to analyse the potential impact of mutations on antibiotic resistance.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    epidaurus

    tumor epigenetic database

    Epidaurus is a collection of epigenetic datasets including transcription factor ChIP-seq, histone ChIP-seq, DNase-seq, FAIRE-seq, DNA methylation, etc. It also includes commonly used genome features (GC content, conservation) and RNA-seq. A user friendly interface has been developed to interrogate and visuzlize these datasets.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    DACO-algorithm

    DACO-algorithm

    A novel transcription factor complex prediction algorithm.

    Eukaryotic gene expression is controlled through molecular logic circuits that combine regulatory signals of many different factors. Complexation of transcription factors and other regulatory proteins is a prevailing and highly conserved mechanism of signal integration within critical regulatory pathways and enable to infer controlled genes as well as the exerted regulatory mechanism. We developed DACO (domain-aware cohesiveness optimization), a novel algorithm that combines...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    uncaptcha

    uncaptcha

    Defeating Google's audio reCaptcha with 85% accuracy

    uncaptcha is an open-source proof-of-concept system designed to demonstrate vulnerabilities in Google’s audio reCAPTCHA challenges by automatically solving them using speech recognition techniques. The project uses browser automation to navigate to CAPTCHA challenges, extract audio files, and process them through multiple speech-to-text services. By combining outputs from several transcription engines, the system increases the likelihood of correctly identifying the spoken digits or phrases...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    openBMD

    openBMD

    A cross-platform transcription application.

    openBMD is a multi-platform desktop application for the transcription of Birth, Marriage and Death (BMD) index records as part of the FreeBMD project to transcribe the Civil Registration index for England and Wales.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    The traditional Siamese music transcription is either in form of sequence of Thai alphabets or numbers. This program read the music transcription from text files and play the transcribed music. This could aid the composers to compose new song easier.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    Kana no quiz is a little educational tool to memorize the transcription and pronunciation of Japanese kana (katakana & hiragana), presented as a quiz. It is written in Python and uses a GTK+ interface for a nice cross-paltform rendering!
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    The Python scripts for the conversion from the Chinese Pinyin transcription(ISO 7098) to International Phonetic Alphabet(IPA), comprised of a core module for developers and a flexible GUI application for the common end-users on Modern Chinese phonetics.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Open-source content and evaluation framework for music transcription systems. Can be used as monophonic or polyphonic database, through software mixing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Keystroke enables the transcription and logging of continuous media such as audio or video. It features media control (pause/play/seek) through keys to increase efficiency while transcribing and/or logging.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    Cellogica is a cellular logic analysis tool. It incorporates sequential logic and finite state machine to elucidate the relationship between transcription factors and corresponding gene expression.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB