I'd like to know if MARF is able to transcript speech to text.
> I'd like to know if MARF is able to transcript speech to text.
Not directly at this point, sorry. You can try CMU Sphinx.
We are working on speech-to-text plug-ins, but not quite
there with the actual man power contributing... :( I am
considering making Sphinx's stuff to make as plugins for
MARF for this purpose as the task by itself is pretty
complex and time consuming to do right, especially if
you want a general-purpose speech-to-text engine. Specific-purpose
is easy, but limiting.
I am also looking for such a program. Really, what we need is something that will convert simple sounds to phonemes. Not interested in language processing.
Ex. If someone said "papa" (onto an attached microphone) the program should generate the text 'papa'.
We are working on the front end of an language independent phoeneme processing system.
Please respond to
Date: Thu, 7 Feb 2008 12:32:34 -0500 (EST)
From: Serguei A. Mokhov <mokhov AT cs concordia ca>
To: siyengar AT winona edu
Cc: Serguei A. Mokhov <mokhov AT cs concordia ca>
Subject: Re: [marf-dev] [marf - Help] RE: Speech to text
On Wed, 6 Feb 2008, SourceForge.net wrote:
> Date: Wed, 06 Feb 2008 11:10:54 -0800
> From: SourceForge.net <firstname.lastname@example.org>
> To: email@example.com
> Subject: [marf-dev] [marf - Help] RE: Speech to text
> I am also looking for such a program. Really, what we need is something
> that will convert simple sounds to phonemes. Not interested in language
> Ex. If someone said "papa" (onto an attached microphone) the program
> should generate the text 'papa'.
This task (of 'papa') is not difficult and can be done without phoneme
processing. Unless I misunderstood you, you only need Segmentation on
word-boundary (by detecting silence gaps) and matching up the segmented
word utterances to a training set dictionary (preprocessed and with
features extracted). Such word-boundary segmentation for MARF is being
worked on as I am writing this.
> We are working on the front end of an language independent phoeneme
> processing system.
If you have to to phonemes, which are components of each word's
utterances, the simple word-boundary segmentation is not enough, and
especially in a continuous flow of speech data followed by a state-machine
based on hidden Markov models (HMMs) and the like. Then you get a good
semblence of a full speech-to-text syste. MARF due to lack of man power to
work on that doesn't do that. (You are welcome to contribute it though if
you are up to it and want to help out ;).
As I mentioned before, CMU Sphinx (also hosted on SourceForge.net and is
also in Java and also licensed in BSD) does that (HMMs and others), so you
can look at that engine.
As far as MARF concerned, as earlier discussion in this and similar
threads indicate, we may simply adapt Sphinx' code as a set of plug-ins
for MARF to do he speech-to-text tasks.
Hope that is of any help to you.
> Please respond to
Doing that ;) With a copy to the forum...
Serguei A. Mokhov | /~\ The ASCII
Computer Science Department | \ / Ribbon Campaign
Concordia University | X Against HTML
Montreal, Quebec, Canada | / \ Email!
I responded to your msg at firstname.lastname@example.org - let me know if you received my response.
I was wondering if you were able to find a solution to the generic speech-to-text transcription problem. I'd appreciate some pointers etc.
Dr. Iyengar and I exchanged a couple of private emails,
and then he fell silent. Either very busy or found
an alternative, e.g. the earlier mentioned CMU Sphinx
(also hosted on SourceForge). Take a look at Sphinx
it may do most of what you seek.
Log in to post a comment.
Sign up for the SourceForge newsletter:
You seem to have CSS turned off.
Please don't fill out this field.