The first version of the Basque to English translator has been released.
This translator was produced with funding from the European Association for Machine Translation, based on data from the Basque to Spanish package, and from Matxin en-eu.
Accompanying the data is a small corpus, of "Metamorphosis" by Kafka, taken from the Basque and English editions of Wikisource (cc-by-sa/gfdl and public domain, respectively), aligned using mALIGNa.... read more
This is a new release of apertium-en-ca!
- Wide support for Valentian forms (financed by Universitat Politècnica
de Valéncia, developed by Prompsit and Jim O'Regan)
- Improved vocabulary and transfer rules (financed by Universitat Oberta
de Catalunya, developed by Prompsit and Jim O'Regan)
This is a note to declare the release of the first version of
apertium-is-en (Icelandic → English). This has been the joint work of
Tungutæknisetur (Icelandic Centre for Language Technology) and the
This is the first released pair to include a rule-based lexical
selection module based on Constraint Grammar.
Some statistics below:
Fri Mar 5 12:51:17 GMT 2010 Total: 2069639, Known: 1731691 (83.67%)... read more
A fully compatible Java port of the upcoming lttoolbox 3.2 has been released.
Compared to lttoolbox some experimental code for compounding and flag match (for multiwords and inner inflection) is included.
Contact Jacob Nordfalk for more information on how (and whether) to use this.
Nic Cottrell contributed an initial version of a Java port of lttoolbox.
During GSOC2009 Raphaël and Sergio worked on it, but didnt get processing to work (compilation and expansion worked)
November 2009 Jacob Nordfalk finished it up and optimized it... read more
Apertium is growing to be a rather large project, and is ready for a more formal system of governance.
Voting for members of the governing board is currently taking place on the apertium-stuff mailing list (https://sourceforge.net/mailarchive/forum.php?forum_name=apertium-stuff); a status page is being maintained on our wiki: http://wiki.apertium.org/wiki/Governance
All developers are invited to register their votes.
Great news: Valencian forms according to the language model agreed by all the Valencian universities can be generated from now on as a new feature of the Apertium Spanish-Catalan translator (for more information see http://www.ua.es/spv/assessorament/criteris.pdf\). This is possible thanks to the Servei de Promoció del Valencià and the Transducens research group at the Departament de Llenguatges i Sistemes Informàtics from the Universitat d'Alacant, and Prompsit Language Engineering. In the next months this will be the default translation system to generate raugh translations from Spanish to Catalan (or the other way round) of all the web contents at the Universitat d'Alacant. The university has decided to strongly support the free/open-source technology of Apertium adapting it to their needs. Well done!
We've just released a new language pair: Norwegian Nynorsk–Norwegian Bokmål, apertium-nn-nb. It's the first released automatic translator for Norwegian developed with the free and open-source Apertium machine translator engine. The pair will be available for testing at at http://www.apertium.org/index.php?id=translatetext .
In developing this system, we used the Free language resources Norsk Ordbank (a full form dictionary with morphological annotations, http://www.edd.uio.no/prosjekt/ordbanken/\) and the Oslo-Bergen tagger (a Constraint Grammar disambiguator, http://omilia.uio.no/obt/\). Both of these resources are released under the GPL as Free software. ... read more
Deuet eo er-maez stumm 0.1.0 hon troer brezhoneg-galleg. Disoc'h ur c'henlabour etre Prompsit Language Engineering, Skol-veur Alacant hag Ofis ar Brezhoneg eo an troer-mañ. Well-wazh e c'holo an troer 85% a skrid ha sevel a ra e feur faziañ gerioù e-tro 35%--45%.
We have released version 0.1.0 of our Breton--French translator. The translator is a result of a collaboration between Prompsit Language Engineering, the Universitat d'Alacant and Ofis ar Brezhoneg. The translator has a coverage of around 85% and a word error rate between 35--45%.
The free/open-source machine translation project Apertium (http://www.apertium.org), one of the 150 projects of the 2009 edition of their Google Summer of Code (http://socghop.appspot.com), has finally been granted 9 three-month scholarships.
Nine students have been selected to work this summer for Apertium (see the list and their tasks at
http://socghop.appspot.com/org/home/google/gsoc2009/apertium\), and will get a US$4,500.00 dollar scholarship.... read more
The free/open-source machine translation project Apertium (http://www.apertium.org) has been selected by Google as one of the 151 projects of the 2009 edition of their Google Summer of Code (http://socghop.appspot.com).
Students wishing to work this summer in one of the ideas that project Apertium has published in http://socghop.appspot.com/org/show/google/gsoc2009/apertium, can apply starting the 23 of March to try and get a 4,500 dollar scholarship.
S'ha publicat una nova versió d'apertium-es-ca, el traductor automàtic entre espanyol i català i viceversa. El nou paquet conté millores especialment en el vocabulari, amb uns 13.000 lemes més que la versió anterior. Ha estat finançat parcialment per la Universitat Oberta de Catalunya, el periòdic ADN i la Kutxa i desenvolupat per Prompsit Language Engineering, Eleka Ingeniaritza Linguistikoa i imaxin|software.... read more
Apertium dictionaries are available for Palm devices. You can build your own mobile dictionaries for Palm using "dix2tiny" (a new task in apertium-dixtools). This project, developed by Enrique Benimeli, has been partially funded by the Universitat d'Alacant (Spain).
There's a "nightly build" for each language pair to download from: http://www.apertium.org/files/palm/
And a few stable mobile dictionaries can be found here (SVN):
https://apertium.svn.sourceforge.net/svnroot/apertium/trunk/apertium-mobile/apertium-tinylex-palm/... read more
(in Esperanto below / pli sube en Esperanto)
We've just released a new language pair, for English to Esperanto, apertium-eo-en. It's the 18th released automatic translator developed with the open-source Apertium machine translator engine, the 4th from English and the 3rd to Esperanto. The pair is already available for testing at at http://www.apertium.org/index.php?id=translatetext&lang=eo .
The stated release goals for this version were: ... read more
The first version (0.5.1) of apertium-en-gl, the Apertium-based English-Galician translator, has just been released. It has been developed by the Galician company imaxin|software, one of the companies in the OpenTrad consortium, with collaboration from the universities of Vigo and Santiago de Compostela, using preexisting data developed by groups from many different institutions. Apertium-en-gl has been developed with the support of the Consellaría de Innovación e Industria of the Galician Government (Xunta) and the Ministry of Industry, Tourism and Commerce of the Spanish Government.
The Apertium project has released linguistic data for translating from Basque to Spanish on this free/open-source machine translation platform. The translator is tuned to help Spanish speakers read Basque newspaper articles but does not aim at generating post-editable output. There are a number of identified problems which may be easily addressed to make it more usable. Developers welcome! Try it online at http://www.erdaratu.eu
(Saesneg isod / English below)
Rydem newydd ryddhau par iaith newydd ar gyfer Cymraeg i Saesneg,
apertium-cy-en. Y bwriadau penodol ar gyfer y fersiwn yma oedd:
* I alluogi i ddysgwyr canfod beth yw testun newyddion cyffredinol.
* I alluogi canfod pwy ddywedodd be wrth bwy.
* I alluogi gwahaniaethu a yw eitem benodol yn ddigon diddorol i gael ei
* Dylai brawddegau o tua 5 o eiriau cael ei gyfieithu'n weddol dda
o Gymraeg i Saesneg.... read more
Hem publicat els paquets apertium-oc-ca i apertium-oc-es per a occità(aranès)-català i occità(aranès)-espanyol. Els sistemes reconeixen i generen almenys dues variants de l'occità: una més especifica, l'aranès, que es fa servir a la Vall d'Aran i és llengua cooficial a Catalunya i una altra més general basada en la variant llenguadociana. Les dades d'aquestos dos paquets lingüístics han estat desenvolupades per l'empresa Prompsit Language Engineering i finançades mitjançant concurs públic per la Generalitat de Catalunya. ... read more
The apertium-morph package provides a set of tools and Java classes that allow the Lucene text search engine (http://lucene.apache.org) to use morphological information to index and search. To that end, the linguistic resources developed for Apertium are used to extract morphological information while indexing.
Version 0.9.0 of the language pair data for Portuguese-Galician (apertium-pt-gl) has just been released. It has been developed by imaxin|software, in collaboration with the Universidade de Vigo and the Universidade de Santiago, with partial funding from the Consellaría de Innovación e Industria of the Galician autonomous government (Xunta de Galicia), building on existing data for es-pt and es-gl.
A new version of apertium-transfer-tools has been released (version 1.0.0).
It includes Unicode support thanks to to Jimmy O Regan (http://sourceforge.net/users/jimregan/)
The first version, named 0.8, of package apertium-dixtools has been released. Apertium-dixtools is a Java application that may be used to manipulate Apertium (.dix) dictionaries in many ways: crossing dictionaries for language pair A-B and B-C to get a first set of dictionaries for pair A-C (defining rules to cross these entries), merging two dictionaries, sorting dictionaries, etc. Apertium-dixtools has been programmed by Enrique Benimeli and funded by the Spanish Ministry of Industry, Tourism and Commerce.
The first version of apertium-en-es, the language-pair data for English<-->Spanish machine translation, named 0.6, has been released. This pair has been developed at the Universitat d'Alacant by Mireia Ginestí with help from Enrique Benimeli and other Apertium developers.
Development has been funded by the Spanish Ministry of Industry, Tourism and Commerce.
The new release (0.8.4) of the language-pair package apertium-en-ca (English-Catalan) contains a reworked set of transfer rules and has been extended to contain some frequent words that were missing from the dictionaries.
The first version of ReTraTos (0.8.0) has been released.
ReTraTos is a toolbox to build linguistic resources useful for machine translation (MT): bilingual dictionaries and transfer rules.
The induction systems and open linguistic data can be used with the Apertium toolbox to build open-source MT systems.
More information can be found at:
Three new language pairs have been released in the Apertium project,
* French -- Spanish (bidirectional), version 0.8
* Esperanto -- Spanish (unidirectional: Spanish->Esperanto), version 0.9
* Esperanto -- Catalan (unidirectional: Catalan->Esperanto), version 0.9
The French -- Spanish language pair was funded by Eleka Ingenieritza Linguistikoa S.L. and developed jointly with Prompsit Language Engineering S.L.... read more