Training data (data labeling, annotation, workflow) for all data types
...Annotation is required because raw media is considered to be unstructured and not usable without it. That’s why training data is required for many modern machine learning use cases including computer vision, natural language processing and speechrecognition.
Just Another SpeechRecognition and Text to Speech software.
JAVT or Just Another Voice Transformer (formerly, it is called Just Another Video Transcriber) is a SpeechRecognition software that also support text to Speech and simple media conversion. JAVT allows you to convert from video files to audio wav file using ffmpeg, and then transcribe the audio file to text using either Microsoft SAPI or CMU Sphinx. You can also open a text file and allow JAVT to read it out for you through text to speech conversion.