AFEWC and eNews corpora are multilingual comparable text articles in Arabic/French/English. Each triple article is related to the same topic. AFEWC corpus is collected from Wikipedia and eNews is collected from euronews.com. Corpora are licensed under Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported License, and available for research purposes only.
Wikipedia text is available under Creative Commons Attribution-ShareAlike 3.0 License.
According to www.euronews.com/terms-and-conditions/ , the entire of euronews’ text is copyrighted under the French copyright laws. You may not modify, publish, translate, transmit, participate in the transfer or sale, create derivative works, or in any way exploit, any of the content, in whole or in part.
To cite: Saad, M.; Langlois, D. & Smaïli, K. (2013), Extracting Comparable Articles from Wikipedia and Measuring their Comparabilities, Procedia - Social and Behavioral Sciences 95 (0), 40-47
crlcl works great