The EASC is an Arabic natural language resources. It contains 153 Arabic articles and 765 human-generated extractive summaries of those articles. These summaries were generated using Mechanical Turk (http://www.mturk.com/).
Among the major features of EASC are:
Names and extensions are formatted to be compatible with current evaluation systems such as ROUGE and AutoSummENG. Available in two encoding formats UTF-8 and ISO-8859-6 (Arabic).
The Essex Arabic Summaries Corpus (EASC) uses copyright material. Users of the corpus are responsible for ensuring that they comply with the terms of the copyrights that apply to the source material and the derived works (summaries) and the terms of relevant copyright law.
Any other original data that is distributed with this corpus is made available under the Creative Commons Attributive/Share Alike license (http://creativecommons.org/licenses/by-sa/3.0/). You must provide details of the source of the material when using it.
Downloads:
2 This Week