機器學習實戰篇——用卷積神經網絡算法在Kaggle上跑個分

之前的文章簡單介紹了 Kaggle平台以及如何用支撐向量（SVM）

的機器學習算法識别手寫數字圖檔。可見即使不用神經網絡，傳統的機器學習算法在圖像識别的領域也能取得不錯的成績（我跑出來了97.2% 的正确率）, 但是要将正确率再往上提升就會遇到瓶頸了。

此時，

神經網絡以及深度學習，尤其是卷積神經網路（CNN）

就派上用場了。

用CNN的網絡，在同樣的平台上，目前我将手寫圖檔識别的正确率提高到了99.1%，排名全球900多名左右。

1、導入庫檔案

使用深度學習的方法當然就要用到大名鼎鼎的TensorFlow。

import pandas as pd
import math
import numpy as np
import matplotlib.pyplot as plt, matplotlib.image as mpimg
from sklearn.model_selection import train_test_split
import tensorflow as tf

%matplotlib inline

2、準備資料

與

之前

一樣，需要對資料進行分成Train 和 Test 兩個組。

labeled_images = pd.read_csv('train.csv')
images = labeled_images.iloc[:,1:]
labels = labeled_images.iloc[:,:1]
train_images, test_images,train_labels, test_labels = train_test_split(images, labels, test_size=0.02)

3、建立幫助函數

這是本問最難的部分，作用實際上就是對資料進行處理，轉換成TensorFlow 讀得懂的資料。

One Hot Encode

我們知道，這些圖檔的标簽（識别結果）就是是0到9的10個數字，結果就是一個nx1的矩陣，n是訓練樣本的個數。為了讓模型更加友善地處理資料（計算機是二進制的，最好給它0，1的資料），需要将資料轉換成nx10的矩陣。比如果其中一個樣闆的标記是3，那麼這一行的數列就應該是[0,0,0,1,0,0,0,0,0,0], 如果是9的話[0,0,0,0,0,0,0,0,0,1]。所有的樣本疊起來就是一個nx10的矩陣。

def one_hot_encode(vec, vals=10):
    '''
    For use to one-hot encode the 10- possible labels
    '''
    n = len(vec)
    out = np.zeros((n, vals))
    out[range(n), vec] = 1
    return out

幫助類

從

AI學習筆記——卷積神經網絡（CNN）

的文章中我們知道，一張圖檔有三個次元——長，寬，顔色通道。對于本文中的黑色圖檔，第三個次元為1。在加上樣本的個數(n)，整個訓練樣本應該是一個(nx28x28x1)的四維Tensor(張量)。set_up_images(self)函數就是将圖檔轉換成這樣的Tensor。next_batch（）函數則是n個訓練樣本分成若幹個batch, 一個一個地送給模型(這個叫mini batch)。

class CifarHelper():
    
    def __init__(self):
        self.i = 0
        
        # Intialize some empty variables for later on
        self.training_images = None
        self.training_labels = None
        
        self.test_images = None
        self.test_labels = None
    
    def set_up_images(self):
        
        print("Setting Up Training Images and Labels")
        
        # Vertically stacks the training images
        self.training_images = train_images.as_matrix()
        train_len = self.training_images.shape[0]
        
        # Reshapes and normalizes training images
        self.training_images = self.training_images.reshape(train_len,28,28,1)/255
        # One hot Encodes the training labels (e.g. [0,0,0,1,0,0,0,0,0,0])
        self.training_labels = one_hot_encode(train_labels.as_matrix().reshape(-1), 10)
        
        print("Setting Up Test Images and Labels")
        
        # Vertically stacks the test images
        self.test_images = test_images.as_matrix()
        test_len = self.test_images.shape[0]
        
        # Reshapes and normalizes test images
        self.test_images = self.test_images.reshape(test_len,28,28,1)/255
        # One hot Encodes the test labels (e.g. [0,0,0,1,0,0,0,0,0,0])
        self.test_labels = one_hot_encode(test_labels.as_matrix().reshape(-1), 10)

        
    def next_batch(self, batch_size):
        # Note that the 100 dimension in the reshape call is set by an assumed batch size of 100
        x = self.training_images[self.i:self.i+batch_size]
        y = self.training_labels[self.i:self.i+batch_size]
        self.i = (self.i + batch_size) % len(self.training_images)
        return x, y

最後這兩行代碼就完成了資料的初始化。

# Before Your tf.Session run these two lines
ch = CifarHelper()
ch.set_up_images()

# During your session to grab the next batch use this line
# (Just like we did for mnist.train.next_batch)
# batch = ch.next_batch(100)

4、建立模型

這裡用到了TensorFlow, 也許會之後在單獨的文章中介紹如何使用，這裡簡單介紹一下。

使用TensorFlow 首先是要建立一個 computation graph(計算圖譜)，也就是先告訴計算機模型是怎樣的，包括神經網絡有多少層，每層多少個神經元，輸入輸出資料的格式是怎的。此時還沒有開始計算。

Placeholder

x 輸入，y輸出，hold_prob用于dropout(不多解釋，主要用于随機丢棄神經元的一種正則化的方法)

x = tf.placeholder(tf.float32, shape=[None,28,28,1])
y_true = tf.placeholder(tf.float32, shape=[None,10])
hold_prob = tf.placeholder(tf.float32)

Help functions

這些函數是為了簡化Tensorflow 建立神經網絡的方法，根據從

之前文章

對CNN的介紹，我們需要卷積層，Pooling(池化)層，以及全連接配接層等等。

def init_weights(shape):
    init_random_dist = tf.truncated_normal(shape, stddev=0.1)
    return tf.Variable(init_random_dist)

def init_bias(shape):
    init_bias_vals = tf.constant(0.1, shape=shape)
    return tf.Variable(init_bias_vals)

def conv2d(x, W):
    return tf.nn.conv2d(x, W, strides=[1, 1, 1, 1], padding='SAME')

# x -->[batch, in_height, in_width, in_channels]
# W --> [filter_height, filter_width, in_channels, out_channels]

def max_pool_2by2(x):
    return tf.nn.max_pool(x, ksize=[1, 2, 2, 1],
                          strides=[1, 2, 2, 1], padding='SAME')
def convolutional_layer(input_x, shape):
    W = init_weights(shape)
    b = init_bias([shape[3]])
    return tf.nn.relu(conv2d(input_x, W) + b)

def normal_full_layer(input_layer, size):
    input_size = int(input_layer.get_shape()[1])
    W = init_weights([input_size, size])
    b = init_bias([size])
    return tf.matmul(input_layer, W) + b

搭建神經網絡

第一層，卷積+Pooling

convo_1 = convolutional_layer(x,shape=[6,6,1,32])
convo_1_pooling = max_pool_2by2(convo_1)

第二層

convo_2 = convolutional_layer(convo_1_pooling,shape=[6,6,32,64])
convo_2_pooling = max_pool_2by2(convo_2)

第三層, 全連接配接

convo_2_flat = tf.reshape(convo_2_pooling,[-1,7*7*64])
full_layer_one = tf.nn.relu(normal_full_layer(convo_2_flat,1024))

Dropout 和輸出

full_one_dropout = tf.nn.dropout(full_layer_one,keep_prob=hold_prob)
y_pred = normal_full_layer(full_one_dropout,10)

定義損失函數，和優化函數，初始化

Loss Function

cross_entropy = tf.reduce_mean(tf.nn.softmax_cross_entropy_with_logits(labels=y_true,logits=y_pred))

Optimizer

optimizer = tf.train.AdamOptimizer(learning_rate=0.00002)
train = optimizer.minimize(cross_entropy)

Intialize Variables

init = tf.global_variables_initializer()

5、訓練模型

之前的準備工作妥當之後，實際上訓練模型的代碼就很短了。用Tensorflow訓練模型，都必須在一個Session 之内并且初始化(都是套路)。

with tf.Session() as sess:
    sess.run(tf.global_variables_initializer())

真正的代碼就這兩行, 實際上就是将之前幫助函數中定義的mini batch 送到模型中進行訓練。

for i in range(50000):
        batch = ch.next_batch(100)
        sess.run(train, feed_dict={x: batch[0], y_true: batch[1], hold_prob: 0.5})

模型要進行50000次的疊代，我們需要将每100此疊代的結果列印出來。完整代碼如下

with tf.Session() as sess:
    sess.run(tf.global_variables_initializer())

    for i in range(50000):
        batch = ch.next_batch(100)
        sess.run(train, feed_dict={x: batch[0], y_true: batch[1], hold_prob: 0.5})
        
        # PRINT OUT A MESSAGE EVERY 100 STEPS
        if i%100 == 0:
            
            print('Currently on step {}'.format(i))
            print('Accuracy is:')
            # Test the Train Model
            matches = tf.equal(tf.argmax(y_pred,1),tf.argmax(y_true,1))

            acc = tf.reduce_mean(tf.cast(matches,tf.float32))

            print(sess.run(acc,feed_dict={x:ch.test_images,y_true:ch.test_labels,hold_prob:1.0}))
            print('\n')
        
    saver.save(sess,'models_saving/my_model.ckpt')

最後得到了98%的準确率

Currently on step 0
Accuracy is:
0.179762


Currently on step 100
Accuracy is:
0.584524

....
....
....

Currently on step 49900
Accuracy is:
0.983333

至此，一個完整的用Tensorflow 訓練CNN的過程就介紹完了，當然要最後還需要儲存模型，用模型對新的資料進行預測，關于這部分的内容就留給讀者自己吧。

————

AI學習筆記——神經網絡和深度學習 AI學習筆記——卷積神經網絡1（CNN）

文章首發steemit.com 為了友善牆内閱讀，搬運至此，歡迎留言或者通路

我的Steemit首頁

機器學習實戰篇——用卷積神經網絡算法在Kaggle上跑個分

1、導入庫檔案

2、準備資料

3、建立幫助函數

One Hot Encode

幫助類

4、建立模型

Placeholder

Help functions

搭建神經網絡

定義損失函數，和優化函數，初始化

5、訓練模型

繼續閱讀

學習軟體測試基礎測試第七天

Zeppelin 配置通路 REST APIApache Zeppelin Configuration REST API

【Torch】最簡潔logging使用指南

筆試面試題目：滑動視窗(二)

27. Remove Element(清單)題目代碼

資料結構與算法（27）——排序（二）

無人機--飛控科普

Dijkstra--簡易版（最短路徑）

GitHub連夜封殺！這份阿裡 10W 字内部 Java 字面試手冊到底有多強？

Cloud Studio初體驗

使用 ctypes 進行 Python 和 C 的混合程式設計

【python】【資料處理】畫多元資料分布圖

【python】netconf協定對接管理裝置

「Python 網絡自動化」NETCONF —— Python 使用 NETCONF 管理配置 H3C 網絡裝置

在python中建立excel并寫入

hdu7108哈希