究竟是什么标识了网站访问者？

Question

Alexander Pozharskii

Asked:2020-01-14 04:27:05 +0000 UTC2020-01-14 04:27:05 +0000 UTC 2020-01-14 04:27:05 +0000 UTC

如何优化在 theano 上使用池化/非池化索引？

772

实际上，任务是尽可能准确地复制theano 上 SpatialMaxPooling 和 SpatialMaxUnpooling 层的行为。

在这种情况下，SpatialMaxUnpooling 只填充对应于相应 SpatialMaxPooling 中最大值索引的那些“单元格”。

例如 - 这是输入图像

SpatialMaxPooling 将存储每个 2x2 区域中具有最大值的像素及其索引。

而 SpatialMaxUnpooling - 只会将值设置为与索引对应的那些像素。也就是说，输出将是

我发布了以下实现：

def pooling2d_2x2(self, x):
    reshaped = x.reshape([
        x.shape[0], x.shape[1], x.shape[2] // 2, 2, x.shape[3] // 2, 2
    ])
    max_values, max_indices = T.max_and_argmax(reshaped, (3,5,))
    return max_values, max_indices

def unpooling2d_2x2(self, pooled, indices):
    tmp_shape = [pooled.shape[0], pooled.shape[1], pooled.shape[2], 2, pooled.shape[3], 2]
    # Resize image
    resized = pooled.repeat(2, 2).repeat(2, 3)
    pooled_reshaped = resized.reshape(tmp_shape)
    # Resize indices
    indices_repeaten = indices.repeat(2, 2).repeat(2, 3).reshape(tmp_shape)
    # Calculate output
    result = pooled_reshaped * 0.0
    result = T.set_subtensor(result[:, :, :, 0, :, 0],
                             pooled_reshaped[:, :, :, 0, :, 0] * T.eq(indices_repeaten[:, :, :, 0, :, 0], 0))
    result = T.set_subtensor(result[:, :, :, 0, :, 1],
                             pooled_reshaped[:, :, :, 0, :, 1] * T.eq(indices_repeaten[:, :, :, 0, :, 1], 1))
    result = T.set_subtensor(result[:, :, :, 1, :, 0],
                             pooled_reshaped[:, :, :, 1, :, 0] * T.eq(indices_repeaten[:, :, :, 1, :, 0], 2))
    result = T.set_subtensor(result[:, :, :, 1, :, 1],
                             pooled_reshaped[:, :, :, 1, :, 1] * T.eq(indices_repeaten[:, :, :, 1, :, 1], 3))
    result_shape = [pooled.shape[0], pooled.shape[1], pooled.shape[2] * 2, pooled.shape[3] * 2]
    return result.reshape(result_shape)

但她在速度上并不出色（顺便说一句，我不会拒绝建议 - 如何配置文件）。因此问题 - 这里可以改进什么？

1 个回答

Voted

Alexander Pozharskii · Answer 1 · 2020-01-14T05:22:49Z

下一个替换（据我了解 theano（这可能是一个非常平庸的理解 :-)）——这里我们不再为新张量分配内存，仅指“增加的”输入张量和索引）稍微增加了速度。但是，也许 - 还有其他可能的改进吗？

def unpooling2d_2x2(self, pooled, indices):
    tmp_shape = [pooled.shape[0], pooled.shape[1], pooled.shape[2], 2, pooled.shape[3], 2]
    # Resize image
    resized = pooled.repeat(2, 2).repeat(2, 3)
    pooled_reshaped = resized.reshape(tmp_shape)
    # Resize indices
    indices_repeaten = indices.repeat(2, 2).repeat(2, 3).reshape(tmp_shape)
    # Calculate output
    result = pooled_reshaped * 0.0
    # Calculate output
    result = T.set_subtensor(pooled_reshaped[:, :, :, 0, :, 0],
                             pooled_reshaped[:, :, :, 0, :, 0] * T.eq(indices_repeaten[:, :, :, 0, :, 0], 0))
    result = T.set_subtensor(result[:, :, :, 0, :, 1],
                             pooled_reshaped[:, :, :, 0, :, 1] * T.eq(indices_repeaten[:, :, :, 0, :, 1], 1))
    result = T.set_subtensor(result[:, :, :, 1, :, 0],
                             pooled_reshaped[:, :, :, 1, :, 0] * T.eq(indices_repeaten[:, :, :, 1, :, 0], 2))
    result = T.set_subtensor(result[:, :, :, 1, :, 1],
                             pooled_reshaped[:, :, :, 1, :, 1] * T.eq(indices_repeaten[:, :, :, 1, :, 1], 3))
    result_shape = [pooled.shape[0], pooled.shape[1], pooled.shape[2] * 2, pooled.shape[3] * 2]

如何优化在 theano 上使用池化/非池化索引？

Python 3.6 - 安装 MySQL (Windows)

C++ 编写程序“计算单个岛屿”。填充一个二维数组 12x12 0 和 1

返回指针的函数

我使用 django 管理面板添加图像，但它没有显示

这些条目是什么意思，它们的完整等效项是什么样的

浏览器仍然缓存文件数据

在 Excel VBA 中激活工作表的问题

为什么内置类型中包含复数而小数不包含？

获得唯一途径

告诉我一个像幻灯片一样创建滚动的库

如何优化在 theano 上使用池化/非池化索引？

1 个回答

相关问题