Lack of multi-application text corpus despite of the surging text data is a serious bottleneck in the text mining and natural language processing especially in Persian language. This project presents a new corpus for NEWS articles analysis in Persian called Persica. NEWS analysis includes NEWS classification, topic discovery and classification, category classification and many more procedures. Dealing with NEWS has special requirements and first of all a valid and reliable corpus to perform the experiments on them.
Please use this reference to cite us:
@inproceedings{eghbalzadeh2012persica,
title={Persica: A Persian corpus for multi-purpose text mining and Natural language processing},
author={Eghbalzadeh, Hamid and Hosseini, Behrooz and Khadivi, Shahram and Khodabakhsh, Ali},
booktitle={Telecommunications (IST), 2012 Sixth International Symposium on},
pages={1207--1214},
year={2012},
organization={IEEE}
}

Project Samples

Project Activity

See All Activity >

License

Creative Commons Attribution ShareAlike License V2.0

Follow Persica-A new Persian corpus for NLP

Persica-A new Persian corpus for NLP Web Site

You Might Also Like
Red Hat Enterprise Linux on Microsoft Azure Icon
Red Hat Enterprise Linux on Microsoft Azure

Deploy Red Hat Enterprise Linux on Microsoft Azure for a secure, reliable, and scalable cloud environment, fully integrated with Microsoft services.

Red Hat Enterprise Linux (RHEL) on Microsoft Azure provides a secure, reliable, and flexible foundation for your cloud infrastructure. Red Hat Enterprise Linux on Microsoft Azure is ideal for enterprises seeking to enhance their cloud environment with seamless integration, consistent performance, and comprehensive support.
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Persica-A new Persian corpus for NLP!

Additional Project Details

Database Environment

Microsoft SQL Server

Registered

2012-06-12