使用opencv截取旋轉框目标

1、第一種方法
2、第二種方法
3、兩種方法的簡單對比
4、opencv 最小面積矩形傳回角度的了解
- 4.1、version4.2之前
- 4.2、version4.2之後

本文列舉了兩種方法，使用的資料如圖,用的是改版rolabelimg标注的

使用opencv截取旋轉框目标1、第一種方法2、第二種方法3、兩種方法的簡單對比4、opencv 最小面積矩形傳回角度的了解

标注檔案有四個點的坐标：

使用opencv截取旋轉框目标1、第一種方法2、第二種方法3、兩種方法的簡單對比4、opencv 最小面積矩形傳回角度的了解

1、第一種方法

總體思路是，找最小面積矩形，接着旋轉，最後crop

import cv2
import numpy as np
import matplotlib.pyplot as plt


def crop_rect(img, rect):
    # get the parameter of the small rectangle
    center, size, angle = rect[0], rect[1], rect[2]
    center, size = tuple(map(int, center)), tuple(map(int, size))

    # get row and col num in img
    height, width = img.shape[0], img.shape[1]

    # calculate the rotation matrix
    M = cv2.getRotationMatrix2D(center, angle, 1)
    # rotate the original image
    img_rot = cv2.warpAffine(img, M, (width, height))

    # now rotated rectangle becomes vertical, and we crop it
    img_crop = cv2.getRectSubPix(img_rot, size, center)

    return img_crop, img_rot

cnts = []
labels=[]
with open('rodog.txt') as f:
    lines = f.read().strip().splitlines()
for line in lines:
    data = line.split()
    cnt=list(map(int,data[:8]))
    label=data[8]
    cnts.append(cnt)
    labels.append(label)

num=len(cnts)
plt.figure(figsize=(15,num*5))
for i,cnt in enumerate(cnts):
    img = cv2.imread("rodog.jpeg")
    cnt = np.reshape(cnt,[4,2])
    # print("cnt:",cnt)
    # find the exact rectangle enclosing the text area
    # rect is a tuple consisting of 3 elements: the first element is the center
    # of the rectangle, the second element is the width, height, and the
    # third element is the detected rotation angle.
    # Example output: ((227.5, 187.50003051757812),
    # (94.57575225830078, 417.98736572265625), -36.982906341552734)
    rect = cv2.minAreaRect(cnt)
    print("rect: {}".format(rect))
    
    # the order of the box points: bottom left, top left, top right,
    box = cv2.boxPoints(rect)
    box = np.int0(box)
    print("box:",box)
    
    # print("bounding box: {}".format(box))
    cv2.drawContours(img, [box], 0, (255, 0, 0), 2)
    # img_crop will the cropped rectangle, img_rot is the rotated image
    img_crop, img_rot = crop_rect(img, rect)
    plt.subplot(num,3,1+i*3)
    plt.imshow(img)
    plt.title('orig')
    plt.subplot(num,3,2+i*3)
    plt.imshow(img_rot)
    plt.title('rotate')
    plt.subplot(num,3,3+i*3)
    plt.imshow(img_crop)
    plt.title('crop')
    # cv2.imwrite(f"orig_img_{i}.jpg", img)
    # cv2.imwrite(f"rotate_img_{i}.jpg", img_rot)
    # cv2.imwrite(f"cropped_img_{i}.jpg", img_crop)

    # cv2.waitKey(0)

rect: ((370.5, 164.50001525878906), (140.75865173339844, 306.3152160644531), 43.54475784301758)
box: [[213 227]
 [425   5]
 [527 101]
 [316 324]]
rect: ((266.9999694824219, 53.499996185302734), (38.984642028808594, 44.95772933959961), 24.22774314880371)
box: [[239  65]
 [258  25]
 [293  41]
 [275  81]]
rect: ((277.5, 132.0), (533.65625, 161.2079315185547), 11.457330703735352)
box: [[  0 157]
 [ 32   0]
 [555 106]
 [523 264]]

顯示是用的bgr圖，沒轉rgb

使用opencv截取旋轉框目标1、第一種方法2、第二種方法3、兩種方法的簡單對比4、opencv 最小面積矩形傳回角度的了解

這種方法目前還沒有遇到問題，如果遇到問題，可能需要在原圖上做pad，相應的标簽檔案也做好修改再進行處理

2、第二種方法

思路是先找最小矩形，接着透視變換得到目标圖。

import cv2
import numpy as np
import matplotlib.pyplot as plt


cnts = []
labels=[]
with open('rodog.txt') as f:
    lines = f.read().strip().splitlines()
for line in lines:
    data = line.split()
    cnt=list(map(int,data[:8]))
    label=data[8]
    cnts.append(cnt)
    labels.append(label)


num=len(cnts)
plt.figure(figsize=(10,num*5))
for i,cnt in enumerate(cnts):
    img = cv2.imread("rodog.jpeg")
    cnt = np.reshape(cnt,[4,2])
    # print("cnt:",cnt)
    # find the exact rectangle enclosing the text area
    # rect is a tuple consisting of 3 elements: the first element is the center
    # of the rectangle, the second element is the width, height, and the
    # third element is the detected rotation angle.
    # Example output: ((227.5, 187.50003051757812),
    # (94.57575225830078, 417.98736572265625), -36.982906341552734)
    rect = cv2.minAreaRect(cnt)
    # print("rect: {}".format(rect))


    # the order of the box points: bottom left, top left, top right,
    # bottom right
    box = cv2.boxPoints(rect)
    box = np.int0(box)
    # print('box:',box)

    # print("bounding box: {}".format(box))
    cv2.drawContours(img, [box], 0, (0, 255, 0), 2)

    # get width and height of the detected rectangle
    width = int(rect[1][0])
    height = int(rect[1][1])
    # print("width,height:",width,height)
    src_pts = box.astype("float32")
    # coordinate of the points in box points after the rectangle has been
    # straightened
    dst_pts = np.array([[0, height-1],
                        [0, 0],
                        [width-1, 0],
                        [width-1, height-1]
                        ], dtype="float32")

    # the perspective transformation matrix
    M = cv2.getPerspectiveTransform(src_pts, dst_pts)

    # directly warp the rotated rectangle to get the straightened rectangle
    warped = cv2.warpPerspective(img, M, (width, height))

    # cv2.imwrite("crop_img.jpg", warped)
    # cv2.waitKey(0)
    plt.subplot(num,2,1+i*2)
    plt.imshow(img)
    plt.title('orig')
    plt.subplot(num,2,2+i*2)
    plt.imshow(warped)
    plt.title('crop')

使用opencv截取旋轉框目标1、第一種方法2、第二種方法3、兩種方法的簡單對比4、opencv 最小面積矩形傳回角度的了解

3、兩種方法的簡單對比

img_crop.shape

(161, 533, 3)

warped.shape

(161, 533, 3)

兩種方式看效果是一樣的，圖像尺寸也一樣，但要對比像素值，還是不一樣，如下代碼，絕對誤差大于20有4.56%

try:
    np.testing.assert_allclose(img_crop,warped,atol=20)
except Exception as e:
    print(e)

Not equal to tolerance rtol=1e-07, atol=20

Mismatched elements: 11739 / 257439 (4.56%)
Max absolute difference: 255
Max relative difference: 255.
 x: array([[[255,   0,   0],
        [255,   0,   0],
        [237,  15,  18],...
 y: array([[[  0, 255,   0],
        [  0, 255,   0],
        [  0, 255,   0],...

4、opencv 最小面積矩形傳回角度的了解

center, size, angle=cv2.minAreaRect(points)

points就是一系的（x,y)的點，center,size,angle分别是最小矩形的中心點，寬高，及angle。對于opencv的這個角度的了解，這裡簡單寫一些，實際是與opencv版本有關，以opencv4.2為分界

4.1、version4.2之前

https://blog.csdn.net/weixin_43229348/article/details/125986969 (https://theailearner.com/tag/cv2-minarearect/) 參考這個部落格就好，一句話說，就是x軸逆時針轉，接觸到的第一條邊就是寬邊w(不在意長短）,轉的過程就是角度從0到-90 ，不包括0。

那麼最小面積矩形的四個點及順序是啥，用以下代碼擷取：

rect = CV2.minAreaRect(cnt)
box = cv2.boxPoints(rect)

box就是四個點的坐标，我們會用這四個點做旋轉或透視變換都是需要的。那麼順序是什麼呢？

使用opencv截取旋轉框目标1、第一種方法2、第二種方法3、兩種方法的簡單對比4、opencv 最小面積矩形傳回角度的了解

可以了解為x軸與w邊相交，箭頭反方向的點為起點，接着順時針。

4.2、version4.2之後

x軸順時針轉，接觸到的第一條邊就是寬邊w(不在意長短）,轉的過程就是角度從0到90 ，不包括0。x軸也與4.2之前的相反。

點順序如下：

使用opencv截取旋轉框目标1、第一種方法2、第二種方法3、兩種方法的簡單對比4、opencv 最小面積矩形傳回角度的了解

可以了解為x軸與w邊相交，箭頭同方向的點為起點，接着順時針。

本文另一篇參考文章：https://jdhao.github.io/2019/02/23/crop_rotated_rectangle_opencv/#fn:2

使用opencv截取旋轉框目标1、第一種方法2、第二種方法3、兩種方法的簡單對比4、opencv 最小面積矩形傳回角度的了解

使用opencv截取旋轉框目标

1、第一種方法

2、第二種方法

3、兩種方法的簡單對比

4、opencv 最小面積矩形傳回角度的了解

4.1、version4.2之前

4.2、version4.2之後

繼續閱讀

無法解析的外部符号 wmain，該符号在函數 "void cdecl mainCRTStartupHelper(struct HINSTANCE *,unsigned short con......

TestLink導出用例轉換工具(XML2Excel)

YAML簡介和PyYAML安全操作YAML支援的類型YAML的優點：yaml的基本文法python操作

cs231n斯坦福基于卷積神經網絡的CV學習筆記（一）KNN和線性分類器/分類器損失/反向傳播一，KNN圖像分類算法二，線性分類器三，線性分類器損失四，反向傳播五，神經網絡

Small tricks

libsvm for python 安裝

學習軟體測試基礎測試第七天

Zeppelin 配置通路 REST APIApache Zeppelin Configuration REST API

【Torch】最簡潔logging使用指南

27. Remove Element(清單)題目代碼

Cloud Studio初體驗

使用 ctypes 進行 Python 和 C 的混合程式設計

【python】【資料處理】畫多元資料分布圖

【python】netconf協定對接管理裝置

「Python 網絡自動化」NETCONF —— Python 使用 NETCONF 管理配置 H3C 網絡裝置

在python中建立excel并寫入