ANN 错误地构建正弦曲线

Question

Тима

Asked:2023-10-22 08:18:25 +0800 CST2023-10-22 08:18:25 +0800 CST 2023-10-22 08:18:25 +0800 CST

向数据添加噪声

772

我为自己编写了一个小神经元来识别 MNIST 数据库中的数字（可以说是从头开始）。它可以很好地处理现成的数据，但是当我想识别我的手写数字时，我会遇到错误 -他们在这里解释了原因。我想问，我应该将图片乘以什么矩阵才能获得数字移位或旋转？

1 个回答

Voted

Тима · Answer 1 · 2023-10-23T08:08:36+08:00

总的来说，当问题在撒谎时，我自己找到了答案。这是代码

import matplotlib.pyplot as plt
import numpy as np
import pandas as pd
from scipy import ndimage
from PIL import ImageEnhance, Image

data = pd.read_csv('....digit-recognizer\train.csv')

data = np.array(data)
image = data[1][1:]
    

def add_some_noise(image):
    
    image.shape = (28,28)
    
    image = image / 1
    
    temp_file = Image.fromarray(image).convert('L')
    temp_size = np.random.randint(10,35)
    temp_file = temp_file.resize((temp_size,temp_size))
    
    image = np.array(temp_file.crop((temp_file.size[0] / 2 - 14, temp_file.size[1] / 2 - 14, temp_file.size[1] / 2 + 14, temp_file.size[1] / 2 + 14)))
    
    
    temp_file = ndimage.rotate(image, random.randint(-20, 20), reshape=False, prefilter=False) / 1
    temp_file = Image.fromarray(temp_file).convert("L")
    
    enhancer = ImageEnhance.Sharpness(temp_file)
    factor = np.random.random(1)[0] * 10
    temp_file = enhancer.enhance(factor)
    image = np.array(temp_file)
    
    image = np.roll(image, random.randint(-4,4), axis = 1)
    image = np.roll(image, random.randint(-2,2), axis = 0)
    
    pixels = [250, 150,  100, 50 , 0]
    noise = np.random.choice(pixels, 784,  p=[0.003, 0.005,  0.007,  0.015, 0.97])

    noise.shape = (28,28)

    image = image + (1 * image < 50) * noise
    
    image.shape = (1,784)
    
    return image[0]

image = image / 1

由于某种原因， fromarray 方法适用于 float，但不适用于 int

    temp_file = Image.fromarray(image).convert('L')
    temp_size = np.random.randint(10,35)
    temp_file = temp_file.resize((temp_size,temp_size))
    
    image = np.array(temp_file.crop((temp_file.size[0] / 2 - 14, temp_file.size[1] / 2 - 14, temp_file.size[1] / 2 + 14, temp_file.size[1] / 2 + 14)))

我随机调整图像大小，然后制作 28x28 的剪切图

temp_file = scipy.ndimage.rotate(image, random.randint(-20, 20), reshape=False, prefilter=False) / 1

旋转图像。除以 1 以匹配数据类型。（返回图像）

temp_file = Image.fromarray(temp_file).convert("L")

来自图像的数组

factor = np.random.random(1)[0] * 10
temp_file = enhancer.enhance(factor)
image = np.array(temp_file)

改变清晰度或模糊度

image = np.roll(image, random.randint(-2,2), axis = 0)

轴移

pixels = [250, 150,  100, 50 , 0]
noise = np.random.choice(pixels, 784,  p=[0.003, 0.005,  0.007,  0.015, 0.97])

noise.shape = (28,28)

image = image + (1 * image < 100) * noise

添加噪音

(1 * image < 100) * noise

正是在那些没有描述数字本身的地方

结果

总的来说，有很多可用的 python 工具是非常酷的）

向数据添加噪声

我看不懂措辞

请求的模块“del”不提供名为“default”的导出

"!+tab" 在 HTML 的 vs 代码中不起作用

我正在尝试解决“猜词”的问题。Python

可以使用哪些命令将当前指针移动到指定的提交而不更改工作目录中的文件？

Python解析野莓

问题：“警告：检查最新版本的 pip 时出错。”

帮助编写一个用值填充变量的循环。解决这个问题

尽管依赖数组为空，但在渲染上调用了 2 次 useEffect

数据不通过 Telegram.WebApp.sendData 发送

向数据添加噪声

1 个回答

相关问题