Tashkeela: Arabic diacritization corpus Icon

Tashkeela: Arabic diacritization corpus

Tashkeela: Arabic discritization Corpus (Vocalized texts)

Add a Review
16 Downloads (This Week)
Last Update:
Download Tashkeela-arabic-diacritized-text-u…bz2
Browse All Files


Tashkeela: Arabic discritization Corpus, Resource, Arabic vocalized texts:
نصوص عربية مشكولة
Contains Arabic text vocalized .
Text -format; 75.6 millions words
Please cite this resource as: T. Zerrouki, A. Balla, Tashkeela: Novel corpus of Arabic vocalized texts, data for auto-diacritization systems, Data in Brief (2017),

Data in Brief



Tashkeela: Arabic diacritization corpus Web Site


  • Arabic diacritization corpus
  • Unicode texts


Other Useful Business Software

Communicate & Connect with Ring Central's VoIP Solution Icon

Cloud Powered Business Phone System

Communicate & Connect with Ring Central's VoIP Solution Icon
1 of 5 2 of 5 3 of 5 4 of 5 5 of 5
129 Reviews
  • Unrivaled value & reliability in one solution
  • Unlimited Calls/SMS/Conferencing/Fax
  • Trusted by 350,000+ Businesses
Write a Review

User Reviews

Be the first to post a review of Tashkeela: Arabic diacritization corpus!

Additional Project Details



Intended Audience

Other Audience



Thanks for helping keep SourceForge clean.

Screenshot instructions:
Red Hat Linux   Ubuntu

Click URL instructions:
Right-click on ad, choose "Copy Link", then paste here →
(This may not be possible with some types of ads)

More information about our ad policies

Briefly describe the problem (required):

Upload screenshot of ad (required):
Select a file, or drag & drop file here.

Please provide the ad click URL, if possible:

Get latest updates about Open Source Projects, Conferences and News.

Sign up for the SourceForge newsletter:

JavaScript is required for this form.

No, thanks