训练隐藏层不起作用

首页课程实战体系课手记专栏慕课教程

训练隐藏层不起作用

从一个月开始，我就自己开始学习机器学习，尤其是深度学习，并为此而努力。学习完所有数学概念后，我决定自己用一个神经元的python程序来完成此工作，该神经元可以正常工作。（超精度）

现在，我决定使用2个神经元，1个输出神经元和2个输入的一个隐藏层来执行此操作，但这是行不通的。实际上，成本并没有降低，准确性也没有提高。但是该程序有效（输出如下）

import numpy as np

import matplotlib.pyplot as plt

def init_variables():

"""

Init model variables (weights, biais)

"""

weights_11 = np.random.normal(size=2)

weights_12 = np.random.normal(size=2)

weight_ouput = np.random.normal(size=2)

bias_11 = 0

bias_12 = 0

bias_output = 0

return weights_11, weights_12, weight_ouput, bias_11, bias_12, bias_output

def get_dataset():

"""

Method used to generate the dataset

"""

#Number of rows per class

row_per_class = 100

#generate rows

sick_people = (np.random.randn(row_per_class,2)) + np.array([-2,-2])

sick_people2 = (np.random.randn(row_per_class,2)) + np.array([2,2])

healthy_people = (np.random.randn(row_per_class,2)) + np.array([-2,2])

healthy_people2 = (np.random.randn(row_per_class,2)) + np.array([2,-2])

features = np.vstack([sick_people,sick_people2, healthy_people, healthy_people2])

targets = np.concatenate((np.zeros(row_per_class*2), np.zeros(row_per_class*2)+1))

#plt.scatter(features[:,0], features[:,1], c=targets, cmap = plt.cm.Spectral)

#plt.show()

return features, targets

def pre_activation(features, weights, bias):

"""

compute pre activation of the neural

"""

return np.dot(features, weights) + bias

def activation(z):

"""

compute the activation (sigmoide)

"""

return 1 / ( 1 + np.exp(-z) )

def derivative_activation(z):

"""

compute the derivative of the activation (derivative of sigmoide)

"""

return activation(z) * (1 - activation(z))

def cost(predictions, targets):

"""

make the difference between predictions and results

"""

return np.mean((predictions - targets)**2)

代码效率不高，因为我试图逐步了解所有内容，我知道问题出在隐藏层的训练上，但是它们尊重我在互联网上看到的公式（神经输入*（预测 - 目标）* sigmoid'（预测）*（weightOfTheNextLayer），这就是为什么我真的不明白。

烙印99

浏览 144回答 1

1回答

慕桂英3389331

可能你的导数函数有一些错误。def derivative_activation(z):    """        compute the derivative of the activation (derivative of sigmoide)    """    return activation(z) * (1 - activation(z))假设您out_F = sigmod(in_F)在最后一个输出层，out_F您的prediction和in_F是最后一个节点的输入。这里对于这个函数，正如你的函数名称所暗示的那样，可能是指对那个进行推导in_F。所以应该是d{out_F}/d{in_F} = out_F * (1 - out_F)试试这个：def derivative_activation(z):    """        compute the derivative of the activation (derivative of sigmoide)    """    return z * (1 - z)

0 0

随时随地看视频慕课网APP