File | Date | Author | Commit |
---|---|---|---|
README.md | 2023-07-01 |
![]() |
[931213] Update README.md |
info.json | 2023-06-30 |
![]() |
[e5080f] Update info.json |
SoundTranscriber can be used to generate automatic transcription / automatic subtitles for audio/video files through a friendly graphical user interface.
Developed by Mahmoud Atef, Ahmed Bakr, and Qais Alrefai from the TecWindow team.
download Sound Transcriber version 1.0.0.
This user guide aims to provide you with a comprehensive understanding of Sound Transcriber and help you make the most of its features.
We highly recommend reading this guide to ensure optimal usage of the program.
Sound Transcriber is an accessible audio-to-text conversion program designed to transcribe audio and video files, it offers support for extracting subtitle files and more.
Developed by Mahmoud Atef, Ahmed Bakr, and Qais Alrefai from the TecWindow team.
Sound Transcriber offers the following features:
We have several planned features in the pipeline, including:
The software currently only supports online conversion using Google's speech recognition, OpenAi's Whisper, and Meta's wit.ai.
Please take note of the following important information:
Sound Transcriber supports the following file extensions for conversion:
.mp3, .wav, .aac, .flac, .oga, .opus, .mp4, .avi, .mkv, .mov.
If we were to include an API key within the program itself, it would likely be blocked after widespread usage by multiple users.
Moreover, wit.ai provides distinct API keys for each language. This means that you need to create an application in the desired language and obtain its corresponding API key.
Unfortunately, it is not feasible for us to gather API keys for all languages since they vary based on individual preferences.
Therefore, we will provide you with instructions on how to obtain your own private API key.
Although the following steps may appear extensive, they are straightforward and only need to be completed once.
You can repeat these steps and create a new application with a different name to obtain an API key for transcribing in another language. If you want to use multiple languages with wit.ai, simply repeat the steps to obtain an API key for each language.
Sound Transcriber supports transcription through the use of OpenAI's Whisper API keys, which are not available for free.
The pricing is based on the number of characters transcribed, and specific plans are not mentioned here.
To get detailed information about the limits and subscription options, please visit this page/. Keep in mind that signing in and adding a payment method is at your own risk.
To obtain an API key for Sound Transcriber, go to the API key page and click on "Create new secret key." Copy the generated key and proceed to add it in the program settings as demonstrated later.
Upon opening the program, you will find an edit box displaying the transcribed result. Use the tab key to navigate through the other options.
The "Language" box allows you to specify the language of the file you want to transcribe. Select the appropriate language using the arrow keys.
Click the "Start" button to initiate the conversion process.
Next, you will find the "Save As" button, which allows you to specify the output saving preferences.
Below that, there is a read-only edit box indicating the path of the file to be transcribed.
Use the "Browse" button to locate and select the file you want to transcribe.
Additionally, you can utilize keyboard shortcuts, which will be explained later.
Please note that the order of items on the screen may differ when navigating with the Tab key.
The program includes several menus accessible by pressing the Alt key.
It contains the names of the services available for conversion, you can selecte any service.
Similar to the NVDA screen reader settings, the Sound Transcriber settings are categorized into several sections, each containing various options. You can navigate between sections using the up and down arrows. Use the Tab and Shift+Tab keys to scroll through the options within the selected section.
This section includes various program-wide options:
The options in this section affect the saving functionality in the program's File menu.
If the autosave feature is disabled, the "Save" option in the "File" menu will perform the same function, saving files according to the specified extensions and path.
This section requires you to enter a secure API key in the provided text box named "Secret Key." You can click the edit button to modify the key. Additionally, you can adjust the segment duration by specifying the duration of each part of the file when using this service.
Note that the file needs to be divided into several segments for conversion. The maximum duration per segment for this service is one minute.
Similar to the previous service, this section allows you to enter an API key. However, the maximum length for each file when using OpenAI is 30 seconds.
As Wit.ai separates languages based on API keys, this section allows you to combine languages as follows:
You will find a list of currently added languages.
Each language has a corresponding hidden edit field for the API key.
Use the modify button to change the key or the add button to add a new key.
Select the language matching your application in Wit.ai, paste the key, and click Add.
Repeat these steps for each language you intend to use. After obtaining a key from the Wit.ai site, return to the settings window to add it.
You can delete individual keys or all saved keys associated with this service using the provided buttons.
Lastly, you can specify the duration of each file segment, ranging from 4 to 20 seconds. Choose the duration that yields the best results.
Press OK when you have finished adjusting the settings.
Sound Transcriber provides several keyboard shortcuts to enhance speed and ease of use.
To convert files, open Sound Transcriber and either browse for the file by clicking "Browse" or use the shortcut Ctrl+O. Alternatively, you can copy the file from your device and paste it using Ctrl+V.
Choose the desired language and service using the provided shortcuts or adjust them in the settings. Press "Start" or use the shortcut Ctrl+Enter to initiate the conversion.
If you encounter any bug with Sound Transcriber, you can use the communication methods available in the "Contact Us" menu under the "Help" section. Provide a detailed explanation of the actions that led to the bug. We recommend sharing the Sound Transcriber.log file, which will assist us in understanding and resolving the bug more effectively.
You can find the file in the following path:
AppData\Roaming\tecwindow\SoundTranscriber
Many thanks to Riad Assoum for translating the user guide into English and proofreading the program's English and Arabic strings.