Lack of multi-application text corpus despite of the surging text data is a serious bottleneck in the text mining and natural language processing especially in Persian language. This project presents a new corpus for NEWS articles analysis in Persian called Persica. NEWS analysis includes NEWS classification, topic discovery and classification, category classification and many more procedures. Dealing with NEWS has special requirements and first of all a valid and reliable corpus to perform the experiments on them.
Please use this reference to cite us:
@inproceedings{eghbalzadeh2012persica,
title={Persica: A Persian corpus for multi-purpose text mining and Natural language processing},
author={Eghbalzadeh, Hamid and Hosseini, Behrooz and Khadivi, Shahram and Khodabakhsh, Ali},
booktitle={Telecommunications (IST), 2012 Sixth International Symposium on},
pages={1207--1214},
year={2012},
organization={IEEE}
}

Project Samples

Project Activity

See All Activity >

License

Creative Commons Attribution ShareAlike License V2.0

Follow Persica-A new Persian corpus for NLP

Persica-A new Persian corpus for NLP Web Site

Other Useful Business Software
Our Free Plans just got better! | Auth0 Icon
Our Free Plans just got better! | Auth0

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
Try free now
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Persica-A new Persian corpus for NLP!

Additional Project Details

Database Environment

Microsoft SQL Server

Registered

2012-06-12