detection_output

detection_output

paddle.fluid.layers. detection_output ( loc, scores, prior_box, prior_box_var, background_label=0, nms_threshold=0.3, nms_top_k=400, keep_top_k=200, score_threshold=0.01, nms_eta=1.0 ) [源代码]

给定回归位置偏移、置信度以及先验框信息计算检测的输出，执行步骤如下：

1.根据先验框(prior_box)信息和回归位置偏移解码出预测框坐标。

2.通过多类非极大值抑制(NMS)获得最终检测输出。

请注意，该操作符没有将最终输出边界框clip至图像大小。

参数：

loc (Variable) - 3-D Tensor，数据类型为float32或float64，表示回归位置偏移。维度为[N,M,4]，M是输入的预测bounding box的个数，N是batch size，每个bounding box有四个坐标值，格式为[xmin,ymin,xmax,ymax]，[xmin,ymin]是左上角坐标，[xmax,ymax]是右下角坐标。
scores (Variable) - 3-D Tensor，数据类型为float32或float64，表示未归一化的置信度。维度为[N,M,C]，N和M的含义同上，C是类别数。
prior_box (Variable) - 2-D Tensor，表示先验框。维度为[M,4]，M是提取的先验框个数，格式为[xmin,ymin,xmax,ymax]。
prior_box_var (Variable) - 2-D Tensor，表示先验框的方差，和 prior_box 维度相同。
background_label (int) - 背景标签类别值，背景标签类别上不做NMS。若设为-1，将考虑所有类别。默认值是0。
nms_threshold (float) - 用于NMS的阈值（threshold），默认值是0.3。
nms_top_k (int) - 基于score_threshold过滤预测框后，NMS操作前，要挑选出的置信度高的预测框的个数。默认值是400。
keep_top_k (int) - NMS操作后，要挑选的bounding box总数。默认值是200。
score_threshold (float) - 置信度得分阈值（Threshold），在NMS之前用来过滤低置信数的边界框（bounding box）。若未提供，则考虑所有框。默认值是0.001。
nms_eta (float) - 一种adaptive NMS的参数，仅当该值小于1.0时才起作用。默认值是1.0。

输出是2-D LoDTensor，形状为[No,6]。每行有6个值：[label,confidence,xmin,ymin,xmax,ymax]。No是该mini-batch总的检测框数。LoD的层级数为1，如果采用偏移的LoD表示，则第i个图像有 LoD[i+1] - LoD[i] 个检测结果，如果等于0，则表示无检测结果。

返回类型：Variable

代码示例：

import paddle.fluid as fluid
pb = fluid.data(name='prior_box', shape=[10, 4], dtype='float32')
pbv = fluid.data(name='prior_box_var', shape=[10, 4], dtype='float32')
loc = fluid.data(name='target_box', shape=[2, 21, 4], dtype='float32')
scores = fluid.data(name='scores', shape=[2, 21, 10], dtype='float32')
nmsed_outs = fluid.layers.detection_output(scores=scores,
                       loc=loc,
                       prior_box=pb,
                       prior_box_var=pbv)