CS 3120 Classification using different classification tools solved

$35.00

Category: You will receive a download link of the .ZIP file upon Payment

Description

5/5 - (1 vote)

Requirement:
Select a public dataset with multiple classes. Use at least three different classifiers to classify them. Compare and discuss the classification performance of different models that you selected. Discuss what could affect the performance for different classifiers (e.g. values of hyperparameters that you used) and how they affect the performance.
1. Select adataset: select one from iris.csv or MNSt.csv or
any public dataset (include your csv file in your submission if you use another dataset)

2. Split the dataset:
Please randomly partition the data into training and testing datasets. For example, 70% for training, and 30% for testing.
If you want to better select the values of some hyperparameters, you will need to do validation (not required, but suggested). You will need to partition the data into these three splits. For example, 70% for training, 10% for validation and 20% for testing. Report your final performance only using the testing dataset.(2pts)

3. Select at least three differenttypes of classifiersbased on different techniques (linear SVM and nonlinear SVM belong to one type) and fit the training set to your models. You might need to convert the string label values to numbers using “LabelEncoder in sklearn”.(2pts)

4. Train your Classifiers and perform validation if needed using different hyperparameters values

5. Evaluate: Once your classifier is trained, you should evaluate the classification performance (accuracy, precision, recall, F-measure) on the test set. These scores need to be included in your report.(5pts)

6. Repeat the training and testing for different classifiers you selected. Write down all classifiers that you selected and write down model resourcefor all of them in your report. If some classifier was defined by your own function, please include that function definition (code) in your report.(5pts)
e.g.
import sklearn.svm as svm #model resource(see below)
from sklearn.tree import DecisionTreeClassifier #model resource(see below)
from sklearn.linear_model import LogisticRegression #model resource(see below)
model1 svm.SVC(kernel=’poly’, C=C,gamma=Gamma)
model2LogisticRegression()
model3 DecisionTreeClassifier()

7. Which is the best classifier and why?Answer or analysis of this question needs to be included in your report. (3pts)

8. What affect the classification performance? Answer or analysis of this question needs to be included in your report. (3pts)
You could try different sizes of the training/testing data, different hyperparameters’ values, Or even different datasets using different classifiers.

9. Bonus up to 5 pts.(Anything that was not listed above.)

Submission:
File 1: Report
File 2: Code (+dataset if you used another dataset that not in blackboard)

Write a report to describe /answer required questions of your design.
Upload your code with comments as a separate .py or zip file.
e.g.
File1: MidtermHW_FirstnameLastname.doc/.pdf (this is the report)
+
File2: MidtermHW_ FirstnameLastname.py (this is the code. only “.py” files accepted.
OR
MidtermHW_ FirstnameLastname.zip if you have multiple “.py” files or another dataset.