PyTorch教程14.4之锚箱-电子发烧友网

物体检测算法通常在输入图像中采样大量区域，判断这些区域是否包含感兴趣的物体，并调整区域的边界，从而更准确地预测物体的真实边界框。不同的模型可能采用不同的区域采样方案。在这里，我们介绍其中一种方法：它生成多个以每个像素为中心的具有不同比例和纵横比的边界框。这些边界框称为锚框。我们将在14.7 节设计一个基于锚框的目标检测模型。

首先，让我们修改打印精度以获得更简洁的输出。

						%matplotlib inline
import torch
from d2l import torch as d2l

torch.set_printoptions(2) # Simplify printing accuracy

						%matplotlib inline
from mxnet import gluon, image, np, npx
from d2l import mxnet as d2l

np.set_printoptions(2) # Simplify printing accuracy
npx.set_np()

						 

14.4.1。生成多个锚框

假设输入图像的高度为h和宽度 w. 我们以图像的每个像素为中心生成具有不同形状的锚框。让规模成为s∈(0,1]纵横比（宽高比）为 r>0. 那么anchor box的宽高分别是hsr和 hs/r，分别。请注意，当中心位置给定时，将确定一个已知宽度和高度的锚框。

为了生成多个不同形状的锚框，让我们设置一系列尺度s1,…,sn和一系列纵横比 r1,…,rm. 当以每个像素为中心使用这些尺度和纵横比的所有组合时，输入图像将总共有whnm锚箱。虽然这些anchor boxes可能会覆盖所有的ground-truth bounding boxes，但是计算复杂度很容易过高。在实践中，我们只能考虑那些包含s1或者r1:

(14.4.1)(s1,r1),(s1,r2),…,(s1,rm),(s2,r1),(s3,r1),…,(sn,r1).

也就是说，以同一个像素为中心的anchor boxes的个数为 n+m−1. 对于整个输入图像，我们将生成总共 wh(n+m−1)锚箱。

上面生成anchor boxes的方法是在下面的multibox_prior函数中实现的。我们指定输入图像、比例列表和纵横比列表，然后此函数将返回所有锚框。

							#@save
def multibox_prior(data, sizes, ratios):
  """Generate anchor boxes with different shapes centered on each pixel."""
  in_height, in_width = data.shape[-2:]
  device, num_sizes, num_ratios = data.device, len(sizes), len(ratios)
  boxes_per_pixel = (num_sizes + num_ratios - 1)
  size_tensor = torch.tensor(sizes, device=device)
  ratio_tensor = torch.tensor(ratios, device=device)
  # Offsets are required to move the anchor to the center of a pixel. Since
  # a pixel has height=1 and width=1, we choose to offset our centers by 0.5
  offset_h, offset_w = 0.5, 0.5
  steps_h = 1.0 / in_height # Scaled steps in y axis
  steps_w = 1.0 / in_width # Scaled steps in x axis

  # Generate all center points for the anchor boxes
  center_h = (torch.arange(in_height, device=device) + offset_h) * steps_h
  center_w = (torch.arange(in_width, device=device) + offset_w) * steps_w
  shift_y, shift_x = torch.meshgrid(center_h, center_w, indexing='ij')
  shift_y, shift_x = shift_y.reshape(-1), shift_x.reshape(-1)

  # Generate `boxes_per_pixel` number of heights and widths that are later
  # used to create anchor box corner coordinates (xmin, xmax, ymin, ymax)
  w = torch.cat((size_tensor * torch.sqrt(ratio_tensor[0]),
          sizes[0] * torch.sqrt(ratio_tensor[1:])))\
          * in_height / in_width # Handle rectangular inputs
  h = torch.cat((size_tensor / torch.sqrt(ratio_tensor[0]),
          sizes[0] / torch.sqrt(ratio_tensor[1:])))
  # Divide by 2 to get half height and half width
  anchor_manipulations = torch.stack((-w, -h, w, h)).T.repeat(
                    in_height * in_width, 1) / 2

  # Each center point will have `boxes_per_pixel` number of anchor boxes, so
  # generate a grid of all anchor box centers with `boxes_per_pixel` repeats
  out_grid = torch.stack([shift_x, shift_y, shift_x, shift_y],
        dim=1).repeat_interleave(boxes_per_pixel, dim=0)
  output = out_grid + anchor_manipulations
  return output.unsqueeze(0)

							 

							#@save
def multibox_prior(data, sizes, ratios):
  """Generate anchor boxes with different shapes centered on each pixel."""
  in_height, in_width = data.shape[-2:]
  device, num_sizes, num_ratios = data.ctx, len(sizes), len(ratios)
  boxes_per_pixel = (num_sizes + num_ratios - 1)
  size_tensor = np.array(sizes, ctx=device)
  ratio_tensor = np.array(ratios, ctx=device)
  # Offsets are required to move the anchor to the center of a pixel. Since
  # a pixel has height=1 and width=1, we choose to offset our centers by 0.5
  offset_h, offset_w = 0.5, 0.5
  steps_h = 1.0 / in_height # Scaled steps in y-axis
  steps_w = 1.0 / in_width # Scaled steps in x-axis

  # Generate all center points for the anchor boxes
  center_h = (np.arange(in_height, ctx=device) + offset_h) * steps_h
  center_w = (np.arange(in_width, ctx=device) + offset_w) * steps_w
  shift_x, shift_y = np.meshgrid(center_w, center_h)
  shift_x, shift_y = shift_x.reshape(-1), shift_y.reshape(-1)

  # Generate `boxes_per_pixel` number of heights and widths that are later
  # used to create anchor box corner coordinates (xmin, xmax, ymin, ymax)
  w = np.concatenate((size_tensor * np.sqrt(ratio_tensor[0]),
            sizes[0] * np.sqrt(ratio_tensor[1:]))) \
            * 
						

PyTorch教程14.4之锚箱

14.4.1。生成多个锚框

PyTorch教程21.3之矩阵分解

PyTorch教程22.6之随机变量

PyTorch教程23.4之使用Google Colab

PyTorch教程23.2之使用亚马逊SageMaker

PyTorch教程23.8之API

PyTorch教程4.1之Softmax回归

PyTorch教程3.6之概括

PyTorch教程4.7之环境与分配转变

PyTorch教程6.2之参数管理

PyTorch教程6.1之层和模块

PyTorch教程10.8之波束搜索

PyTorch教程12.1之优化和深度学习

PyTorch教程12.2之凸度

PyTorch教程13.4之硬件

PyTorch教程13.3之自动并行

PyTorch教程13.2之异步计算

PyTorch教程14.2之微调

PyTorch教程14.1之图像增强

PyTorch教程6.7之显卡

PyTorch教程2.5之自动微分

PyTorch教程2.3之线性代数

PyTorch教程3.1之线性回归

PyTorch教程2.6之概率统计

PyTorch教程14.11之全卷积网络

PyTorch教程14.10之转置卷积

PyTorch教程19.3之异步随机搜索

PyTorch教程21.1之推荐系统概述

PyTorch教程7.3之填充和步幅

PyTorch教程7.2之图像卷积

PyTorch教程8.2之使用块的网络(VGG)

振弦式锚杆应力计的工作原理与数据计算方法

pytorch怎么在pycharm中运行

PyTorch的介绍与使用案例

tensorflow和pytorch哪个更简单?

如何使用PyTorch建立网络模型

恒温恒湿试验箱：科技之翼，质量之锚

HT for Web (Hightopo) 使用心得（3）- 吸附与锚点

基于PyTorch AMD的解决方案

使用PyTorch加速图像分割

深度学习框架pytorch介绍

深度学习框架pytorch入门与实践

防爆试验箱之原理说明

PyTorch教程-14.4. 锚箱

PyTorch 的 Autograd 机制和使用

案例分享： OFDR用于锚杆应力测试

基于PyTorch的深度学习入门教程之PyTorch的自动梯度计算

基于PyTorch的深度学习入门教程之PyTorch简单知识

基于PyTorch的深度学习入门教程之PyTorch重点综合实践

基于PyTorch的深度学习入门教程之使用PyTorch构建一个神经网络

苹果iOS/iPadOS 14.4正式版终于发布

苹果iOS14.4和iPadOS14.4正式版发布

苹果重磅推送iOS 14.4系统更新

苹果面向开发人员推送iOS 14.4 RC版更新页面

苹果重磅发布iOS 14.4标准正式版

苹果发布iOS14.4与 iPadOS14.4 RC版更新

锚杆内外径及螺距如何在线检测

苹果iOS14.4 Beta更新了什么内容?

苹果iOS 14.4/iPadOS 14.4公测版beta1发布

预应力锚具之锚具连接器的知识

一文解构PyTorch：深入了解PyTorch内部机制

下载排行榜

爱华AIWA HS-J202维修手册

PC5502负载均流控制电路数据手册

飞利浦D8714收录机说明书

H110主板CPU PWM芯片ISL95858HRZ-T核心供电电路图资料

⼯业电源&模块电源产品⼿册

UWB653Pro USB口测距通信定位模块规格书