代码之家 › 专栏 › 技术社区 › mon

反向传播(Andrew Ng的库塞尔ML)梯度下降澄清

backpropagation machine-learning

mon · 技术社区 · 4 年前

% Calculate the gradients of Weight2
% Derivative at Loss function J=L(Z) : dJ/dZ = (oi-yi)/oi(1-oi)
% Derivative at Sigmoid activation function dZ/dY = oi(1-oi)

delta_theta2 = oi - yi;  % <--- (dJ/dZ) * (dZ/dY) 

# Using +/plus NOT -/minus
Theta2_grad = Theta2_grad +     <-------- Why plus(+)?
              bsxfun(@times, hi, transpose(delta_theta2));

代码摘录

for i = 1:m  
    % i is training set index of X (including bias). X(i, :) is 401 data.
    xi = X(i, :);
    yi = Y(i, :);
    
    % hi is the i th output of the hidden layer. H(i, :) is 26 data.
    hi = H(i, :);
    
    % oi is the i th output layer. O(i, :) is 10 data.
    oi = O(i, :);
    
    %------------------------------------------------------------------------
    % Calculate the gradients of Theta2
    %------------------------------------------------------------------------
    delta_theta2 = oi - yi;
    Theta2_grad = Theta2_grad + bsxfun(@times, hi, transpose(delta_theta2));
 
    %------------------------------------------------------------------------
    % Calculate the gradients of Theta1
    %------------------------------------------------------------------------
    % Derivative of g(z): g'(z)=g(z)(1-g(z)) where g(z) is sigmoid(H_NET).
    dgz = (hi .* (1 - hi));
    delta_theta1 = dgz .* sum(bsxfun(@times, Theta2, transpose(delta_theta2)));
    % There is no input into H0, hence there is no theta for H0. Remove H0.
    delta_theta1 = delta_theta1(2:end);
    Theta1_grad = Theta1_grad + bsxfun(@times, xi, transpose(delta_theta1));
end

我以为是减去导数。

Derivative of Binary Cross Entropy - why are my signs not right?

1 回复 | 直到 4 年前

ntlarry 4 年前

由于梯度是通过平均所有训练示例的梯度来计算的,因此我们首先“累积”梯度,同时在所有训练示例上循环。我们通过对所有训练示例的梯度求和来实现这一点。所以用加号高亮显示的线不是渐变更新步骤。(注意alpha也不在那里)它可能在别的地方。它很可能在1到m的环外。

另外,我不确定您何时会了解到这一点(我确定它在课程的某个地方),但您也可以将代码矢量化:)

推荐文章

SkyWalker · 如何使“UnfinedMetricWarning”静音?

6 月前

Morph3us · 我如何确定谁将赢得罗马尼亚下一届预选赛?[关闭]

6 月前

explorer · AWS SageMaker项目模板创建失败

10 月前

nicklaus-slade · 交叉验证函数返回“未知标签类型:(数组([0.0,1.0],dtype=object),)”

11 月前

Adrian Zambrana · ValuerError:发现样本数不一致的输入变量

12 月前

Kamugg · 在PyTorch中使用不同分辨率图像训练DeepLabV3的最佳实践

1 年前

me0076 · 使用LLM提取多个实体

1 年前

staplegun · scikit中的gbrt_minimize如何决定要尝试多少个参数拆分

1 年前

Chinmaya Tewari · 创建新csv文件时权限被拒绝

1 年前

Seán Healy · LSTM或变压器模型是否有任何可逆实现?

1 年前