Here you go : http://www.sandaru1.com/si-LK.tar.gz
I do have this file at http://paragn.fedorapeople.org/si-LK.tar.gz but I need some upstream source download URL. If you can host it somewhere then that will be helpful.
On 23/08/12 11:24, Harshula wrote:
Can the processing steps be automated in a shell script or makefile?
That way Parag can d/l the UCSC word list and build the final output
On Thu, 2012-08-23 at 11:09 +0530, Sandaruwan Gunathilake wrote:
The original word list is still available in the UCSC
page : http://www.ucsc.cmb.ac.lk/ltrl/?page=downloads
I don't have the processed file at the moment - I'll dig up my backups
and check whether I still have them. It's still in the firefox addon
though : https://addons.mozilla.org/en-us/firefox/addon/sinhala-spellchecker/
On Thu, Aug 23, 2012 at 10:45 AM, Harshula <email@example.com> wrote:
Parag (CC'd) is wondering where the upstream source tarball
for the word
On Mon, 2010-07-05 at 00:59 +0530, Sandaruwan Gunathilake
> On Sun, Jul 4, 2010 at 11:57 PM, Harshula
> Hi Sandaruwan,
> On Sun, 2010-07-04 at 22:01 +0530, Sandaruwan
> > What about the sinhala words list on UCSC language
> > http://www.ucsc.cmb.ac.lk/ltrl/?page=downloads
> > I switched the word list to that in spellchecker
> The LTRL word list states it has 70142 distinct
> appears to have 26707 words. Did you take a subset
> words from the
> LTRL word list?
> No, everything is there. I just used compressed the words
> "affixcompress" utility and added few extra rules at the top
> file to support "ණ/න/ල/ළ", etc.
> Best Regards,
> Sandaruwan Gunathilake