前言

前面五篇文章，介紹了模型搭建、資料準備及pytorch中常用的計算方法等，有了上述基礎後就可以訓練和測試模型了，下面這篇文章會簡單介紹下在pytorch架構下如何測試深度學習模型，以及一些常用代碼。

pytorch checkpoint_深度學習-Pytorch架構學習之模型訓練和測試2前言模型測試Mixup訓練儲存與加載斷點提取預訓練模型某層的卷積特征

模型測試

同樣，以一個簡單的分類模型為例，對應的測試代碼：

# 模型測試model.eval() with torch.no_grad():    correct = 0    total = 0    for images, labels in test_loader:        images = images.to(device)        labels = labels.to(device)        outputs = model(images)        _, predicted = torch.max(outputs.data, 1)        total += labels.size(0)        correct += (predicted == labels).sum().item()            print('在10000張測試照片上測試模型的精度: {} %' .format(100 * correct / total))

Mixup訓練

beta_distribution = torch.distributions.beta.Beta(alpha, alpha)for images, labels in train_loader:    images, labels = images.cuda(), labels.cuda()    # 混合照片和标簽。    lambda_ = beta_distribution.sample([]).item()    index = torch.randperm(images.size(0)).cuda()    mixed_images = lambda_ * images + (1 - lambda_) * images[index, :]    label_a, label_b = labels, labels[index]    # 混合損失。    scores = model(mixed_images)    loss = (lambda_ * loss_function(scores, label_a)            + (1 - lambda_) * loss_function(scores, label_b))    optimizer.zero_grad()    loss.backward()    optimizer.step()

儲存與加載斷點

訓練過程中，可能會出現訓練意外中斷的情況。為了能夠恢複訓練，需要同時儲存模型和優化器的狀态，以及目前的訓練輪數。

start_epoch = 0# 加載checkpoint.if resume: # resume為參數，第一次訓練時設為0，中斷再訓練時設為1    model_path = os.path.join('model', 'best_checkpoint.pth.tar')    assert os.path.isfile(model_path)    checkpoint = torch.load(model_path)    best_acc = checkpoint['best_acc']    start_epoch = checkpoint['epoch']    model.load_state_dict(checkpoint['model'])    optimizer.load_state_dict(checkpoint['optimizer'])    print('Load checkpoint at epoch {}.'.format(start_epoch))    print('Best accuracy so far {}.'.format(best_acc))# 訓練模型for epoch in range(start_epoch, num_epochs):     ...     # 測試模型    ...    # 儲存checkpoint    is_best = current_acc > best_acc    best_acc = max(current_acc, best_acc)    checkpoint = {        'best_acc': best_acc,        'epoch': epoch + 1,        'model': model.state_dict(),        'optimizer': optimizer.state_dict(),    }    model_path = os.path.join('model', 'checkpoint.pth.tar')    best_model_path = os.path.join('model', 'best_checkpoint.pth.tar')    torch.save(checkpoint, model_path)    if is_best:        shutil.copy(model_path, best_model_path)

提取預訓練模型某層的卷積特征

很多模型，往往不會從頭開始訓練，這樣會浪費大量的時間，是以常用的做法是加載其他預訓練的模型。以ImageNet預訓練模型為例，提取其中某一層的卷積特征：

# VGG-16 relu5-3 feature.model = torchvision.models.vgg16(pretrained=True).features[:-1]# VGG-16 pool5 feature.model = torchvision.models.vgg16(pretrained=True).features# VGG-16 fc7 feature.model = torchvision.models.vgg16(pretrained=True)model.classifier = torch.nn.Sequential(*list(model.classifier.children())[:-3])# ResNet GAP feature.model = torchvision.models.resnet18(pretrained=True)model = torch.nn.Sequential(collections.OrderedDict(    list(model.named_children())[:-1]))with torch.no_grad():    model.eval()    conv_representation = model(image)

同理，也可以提取ImageNet 預訓練模型中多層的卷積特征：

class FeatureExtractor(torch.nn.Module):    """Helper class to extract several convolution features from the given    pre-trained model.    Attributes:        _model, torch.nn.Module.        _layers_to_extract, list or set    Example:        >>> model = torchvision.models.resnet152(pretrained=True)        >>> model = torch.nn.Sequential(collections.OrderedDict(                list(model.named_children())[:-1]))        >>> conv_representation = FeatureExtractor(                pretrained_model=model,                layers_to_extract={'layer1', 'layer2', 'layer3', 'layer4'})(image)    """    def __init__(self, pretrained_model, layers_to_extract):        torch.nn.Module.__init__(self)        self._model = pretrained_model        self._model.eval()        self._layers_to_extract = set(layers_to_extract)    def forward(self, x):        with torch.no_grad():            conv_representation = []            for name, layer in self._model.named_children():                x = layer(x)                if name in self._layers_to_extract:                    conv_representation.append(x)            return conv_representation

未完待續...

pytorch checkpoint_深度學習-Pytorch架構學習之模型訓練和測試2前言模型測試Mixup訓練儲存與加載斷點提取預訓練模型某層的卷積特征

pytorch checkpoint_深度學習-Pytorch架構學習之模型訓練和測試2前言模型測試Mixup訓練儲存與加載斷點提取預訓練模型某層的卷積特征

前言

模型測試

Mixup訓練

儲存與加載斷點

提取預訓練模型某層的卷積特征

繼續閱讀

python pth檔案是什麼_嫌python慢？來這裡用pytorch C++前端推理模型

pytorch checkpoint_2018.12.01：使用ONNX轉換PyTorch模型到 Tensorflow *.pb檔案

pytorch resnet50預訓練模型_pytorch中文語言模型bert預訓練代碼

pytorch checkpoint_[日常] PyTorch 預訓練模型，儲存，讀取和更新模型參數以及多 GPU 訓練模型

深度學習模型儲存_解讀計算機視覺的深度學習模型

pytorch forward_PyTorch提取中間層特征？

pytorch checkpoint_記錄一次pytorch加載模型的采坑

pytorch load state dict_Pytorch學習記錄-使用Pytorch進行深度學習，儲存和加載模型

pytorch checkpoint_PyTorch 使用 Horovod 進行分布式訓練

knn pytorch_[PyTorch 學習筆記] 8.1 圖像分類簡述與 ResNet 源碼分析

pytorch load state dict_PyTorch 學習筆記（五）：Finetune和各層定制學習率

pytorch load state dict_學習Pytorch過程遇到的坑（持續更新中）

深度學習模型儲存_TensorFlow 2 模型：深度強化學習

pytorch load state dict_PyTorch使用預訓練模型

pytorch load state dict_pytorch源碼閱讀（二）optimizer原理

parallels網絡初始化失敗_TensorFlow 1.xx加載預訓練模型但Tensor不比對導緻失敗的解決方法...