機器學習之使用 scikit-learn 識别手寫數字

機器學習案例分析一：

識别手寫數字：

scikit-learn 提供了一些标準資料集，例如用于分類的 iris 和 digits 資料集和波士頓房價回歸資料集 .

在下文中，我們從我們的 shell 啟動一個 Python 解釋器IDLE，然後加載 iris 和 digits 資料集。

資料集是一個類似字典的對象，它儲存有關資料的所有資料和一些中繼資料。該資料存儲在 .data 成員中，它是 n_samples,

n_features 數組。在監督問題的情況下，一個或多個響應變量存儲在 .target 成員中。

識别手寫數字源碼：

1、導入所需子產品

'''
手寫數字識别
'''
print(__doc__)

import matplotlib.pyplot as plt
from sklearn import datasets, svm, metrics

處理資料集：

# The digits dataset
digits = datasets.load_digits()

images_and_labels = list(zip(digits.images, digits.target))
for index, (image, label) in enumerate(images_and_labels[:]):
    plt.subplot(, , index + )
    plt.axis('off')
    plt.imshow(image, cmap=plt.cm.gray_r, interpolation='nearest')
    plt.title('Training: %i' % label)


n_samples = len(digits.images)
data = digits.images.reshape((n_samples, -))

資料展示：

機器學習之使用 scikit-learn 識别手寫數字

創捷支援向量機的SVM的分類器：

# Create a classifier: a support vector classifier
classifier = svm.SVC(gamma=)

學習過程：

# We learn the digits on the first half of the digits
classifier.fit(data[:n_samples // 2], digits.target[:n_samples // 2])

預測過程:

# Now predict the value of the digit on the second half:
expected = digits.target[n_samples // :]
predicted = classifier.predict(data[n_samples // :])

print("Classification report for classifier %s:\n%s\n"
      % (classifier, metrics.classification_report(expected, predicted)))
print("Confusion matrix:\n%s" % metrics.confusion_matrix(expected, predicted))

images_and_predictions = list(zip(digits.images[n_samples // :], predicted))
for index, (image, prediction) in enumerate(images_and_predictions[:]):
    plt.subplot(, , index + )
    plt.axis('off')
    plt.imshow(image, cmap=plt.cm.gray_r, interpolation='nearest')
    plt.title('Prediction: %i' % prediction)
plt.show()

機器學習之使用 scikit-learn 識别手寫數字

機器學習案例分析一：

繼續閱讀

卷積後尺寸計算公式

線性回歸之随機梯度下降（sgd）