天天看点

Weka(二)—Classification(糖尿病数据集&Cross-validation交叉验证&meta-classifier(CVParameter)&Weka Knowledge Flow)How to use Weka to run a classifier(a classification model)meta-classifier

diabetes prediction dataset

https://archive.ics.uci.edu/ml/datasets/Early+stage+diabetes+risk+prediction+dataset.

在weka中打开

Weka(二)—Classification(糖尿病数据集&Cross-validation交叉验证&meta-classifier(CVParameter)&Weka Knowledge Flow)How to use Weka to run a classifier(a classification model)meta-classifier

How to use Weka to run a classifier(a classification model)

Weka(二)—Classification(糖尿病数据集&Cross-validation交叉验证&meta-classifier(CVParameter)&Weka Knowledge Flow)How to use Weka to run a classifier(a classification model)meta-classifier
Weka(二)—Classification(糖尿病数据集&Cross-validation交叉验证&meta-classifier(CVParameter)&Weka Knowledge Flow)How to use Weka to run a classifier(a classification model)meta-classifier

Choose classifier

Weka(二)—Classification(糖尿病数据集&Cross-validation交叉验证&meta-classifier(CVParameter)&Weka Knowledge Flow)How to use Weka to run a classifier(a classification model)meta-classifier
Weka(二)—Classification(糖尿病数据集&Cross-validation交叉验证&meta-classifier(CVParameter)&Weka Knowledge Flow)How to use Weka to run a classifier(a classification model)meta-classifier
这个就是C4.5决策树算法的实现(weka成为J48)
Weka(二)—Classification(糖尿病数据集&Cross-validation交叉验证&meta-classifier(CVParameter)&Weka Knowledge Flow)How to use Weka to run a classifier(a classification model)meta-classifier

这里 -C 0.25 是Confidence Factor=0.25

-M 2 是minNumObj=2,即 the minimum number of instances per leaf

可以在这change options

Weka(二)—Classification(糖尿病数据集&Cross-validation交叉验证&meta-classifier(CVParameter)&Weka Knowledge Flow)How to use Weka to run a classifier(a classification model)meta-classifier

Classifier evalution

for several classifier evaluation method, see 

可以看到这里有几个选项可以选择

Weka(二)—Classification(糖尿病数据集&Cross-validation交叉验证&meta-classifier(CVParameter)&Weka Knowledge Flow)How to use Weka to run a classifier(a classification model)meta-classifier
Weka(二)—Classification(糖尿病数据集&Cross-validation交叉验证&meta-classifier(CVParameter)&Weka Knowledge Flow)How to use Weka to run a classifier(a classification model)meta-classifier

k-fold Cross-validation in Weka

Weka(二)—Classification(糖尿病数据集&Cross-validation交叉验证&meta-classifier(CVParameter)&Weka Knowledge Flow)How to use Weka to run a classifier(a classification model)meta-classifier

Weka(二)—Classification(糖尿病数据集&Cross-validation交叉验证&meta-classifier(CVParameter)&Weka Knowledge Flow)How to use Weka to run a classifier(a classification model)meta-classifier

Weka(二)—Classification(糖尿病数据集&Cross-validation交叉验证&meta-classifier(CVParameter)&Weka Knowledge Flow)How to use Weka to run a classifier(a classification model)meta-classifier

meta-classifier

Weka provides a set of meta-classifiers that combine tools with existing classifiers

CVParameterSelection

采用交叉验证的方法,对参数进行优化选择
Weka(二)—Classification(糖尿病数据集&Cross-validation交叉验证&meta-classifier(CVParameter)&Weka Knowledge Flow)How to use Weka to run a classifier(a classification model)meta-classifier

如果要使用J48 algorithm using CVParameterSelection

就要先选择CVParameterSelection,然后在CVParameterSelection的参数选择的classifier中选择J48 algorithm

Weka(二)—Classification(糖尿病数据集&Cross-validation交叉验证&meta-classifier(CVParameter)&Weka Knowledge Flow)How to use Weka to run a classifier(a classification model)meta-classifier
Weka(二)—Classification(糖尿病数据集&Cross-validation交叉验证&meta-classifier(CVParameter)&Weka Knowledge Flow)How to use Weka to run a classifier(a classification model)meta-classifier
执行结果中可以看到classifier选择的C,也就是最有的C值
Weka(二)—Classification(糖尿病数据集&Cross-validation交叉验证&meta-classifier(CVParameter)&Weka Knowledge Flow)How to use Weka to run a classifier(a classification model)meta-classifier
即C的值0.2是最优的
Weka(二)—Classification(糖尿病数据集&Cross-validation交叉验证&meta-classifier(CVParameter)&Weka Knowledge Flow)How to use Weka to run a classifier(a classification model)meta-classifier

Weka Knowledge Flow

Weka(二)—Classification(糖尿病数据集&Cross-validation交叉验证&meta-classifier(CVParameter)&Weka Knowledge Flow)How to use Weka to run a classifier(a classification model)meta-classifier