This classification problem involves classifying 20000 messages into 20 different classes. The dataset can be found here: https://archive.ics.uci.edu/ml/datasets/Twenty+Newsgroups. Four Machine Learning algorithms; Naïve Bayes, Logistic Regression, Regularized Logistic Regression Support Vector Machine (SVM) were implemented and there training and test dataset accuracy were compared. Arguably, one of the most important aspect to solving this problem is having the appropriate data set format. Each of these algorithms has its peculiar data format; the specific format and how to reconstruct the entire dataset are illustrated in other sections below. Out of all the methods, SVM using the Libsvm [1] produced the most accurate and optimized result for its classification accuracy for the 20 classes. All the algorithm implementation was written Matlab.
Download the code and Report here.

Project Activity

See All Activity >

Follow classify-20-NG-with-4-ML-Algo

classify-20-NG-with-4-ML-Algo Web Site

Other Useful Business Software
Incredable is the first DLT-secured platform that allows you to save time, eliminate errors, and ensure your organization is compliant all in one place. Icon
Incredable is the first DLT-secured platform that allows you to save time, eliminate errors, and ensure your organization is compliant all in one place.

For healthcare Providers and Facilities

Incredable streamlines and simplifies the complex process of medical credentialing for hospitals and medical facilities, helping you save valuable time, reduce costs, and minimize risks. With Incredable, you can effortlessly manage all your healthcare providers and their credentials within a single, unified platform. Our state-of-the-art technology ensures top-notch data security, giving you peace of mind.
Learn More
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of classify-20-NG-with-4-ML-Algo!

Additional Project Details

Registered

2016-01-21