I am trying to de-compress *.wv1 files in WSJ Corpora. The shorten package included with the dataset is very old (with references to NEXT OS and its friends), and I'm unable to compile it. I also tried the latest version of Sphere available from NIST's website (which has shorten included), but it can only compile files in /src/lib; not /src/bin. Can anyone suggest what changes are needed to get either of these to work; or can someone post an executable or modified source that could be used on either Ubuntu or Cygwin?
cheers!
Sunny
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Hi,
I am trying to de-compress *.wv1 files in WSJ Corpora. The shorten package included with the dataset is very old (with references to NEXT OS and its friends), and I'm unable to compile it. I also tried the latest version of Sphere available from NIST's website (which has shorten included), but it can only compile files in /src/lib; not /src/bin. Can anyone suggest what changes are needed to get either of these to work; or can someone post an executable or modified source that could be used on either Ubuntu or Cygwin?
cheers!
Sunny
You need to download sph2pipe from NIST, it has both Windows and Linux precompiled versions.