代码之家 › 专栏 › 技术社区 › warwcat

在具有一个特征的线性回归中,梯度下降系数通过每次迭代而增加

machine-learning numpy pandas python-3.x

warwcat · 技术社区 · 7 年前

您好,我正在学习一些机器学习算法,为了理解,我试图实现一种线性回归算法,其中一个特征是使用梯度下降法的残差平方和作为成本函数,如下所示:

我的伪代码:

 while not converge
     w <- w - step*gradient

python代码

import math
import numpy as num

def get_regression_predictions(input_feature, intercept, slope):
    predicted_output = [intercept + xi*slope for xi in input_feature]
    return(predicted_output)

def rss(input_feature, output, intercept,slope):
    return sum( [ ( output.iloc[i] - (intercept + slope*input_feature.iloc[i]) )**2 for i in range(len(output))])

def train(input_feature,output,intercept,slope):


    file = open("train.csv","w")
    file.write("ID,intercept,slope,RSS\n")
    i =0

    while True:

        print("RSS:",rss(input_feature, output, intercept,slope))
 file.write(str(i)+","+str(intercept)+","+str(slope)+","+str(rss(input_feature, output, intercept,slope))+"\n")
        i+=1

        gradient = [derivative(input_feature, output, intercept,slope,n) for n in range(0,2) ]

        step = 0.05
        intercept -= step*gradient[0]
        slope-= step*gradient[1]
    return intercept,slope 


 def derivative(input_feature, output, intercept,slope,n):
     if n==0:
         return sum( [ -2*(output.iloc[i] - (intercept + slope*input_feature.iloc[i])) for i in range(0,len(output))] ) 
     return sum( [  -2*(output.iloc[i] - (intercept + slope*input_feature.iloc[i]))*input_feature.iloc[i]  for i in range(0,len(output))] )

使用主程序:

import Linear as lin
import pandas as pd
import numpy as np
from sklearn.model_selection import train_test_split




df = pd.read_csv("test2.csv")


train = df

lin.train(train["X"],train["Y"], 0, 0)

测试2。csv:

X,Y
0,1
1,3
2,7
3,13
4,21

ID,intercept,slope,RSS
0,0,0,669
1,4.5,14.0,3585.25
2,-7.25,-18.5,19714.3125
3,19.375,58.25,108855.953125

从数学上讲,我认为这没有任何意义,我多次查看自己的代码,我认为它是正确的,我在做其他错误的事情?

1 回复 | 直到 7 年前

alkasm Anuj Gautam 7 年前

如果你的成本没有下降,这通常是一个迹象,你的梯度下降方法,这意味着过大的步长。

推荐文章

Mainland · Python数据帧规范化值错误:列的长度必须与键相同

1 年前

user026 · 如何根据特定窗口的平均值(行数)创建新列?

1 年前

rpn · 如何在列[1]中连续第二次出现“0”时返回列[0]的值

1 年前

asmgx · 为什么合并数据帧不能按照python中的预期方式工作

1 年前

Gtoth · 如何分割Pandas DataFrame中包含多个日期的两个时间戳之间的差异

1 年前

Domarius · 使用loc为多行设置多列值

1 年前

Swastik Bhattacharyya · 如何在同一类别类型的多列上运行get_dummies()函数?

1 年前

DrZoidberg09 · 如何在字典列表中创建一个新关键字,该关键字是另一个关键字的总和?

1 年前

armstrong3701 · 如何有效地处理熊猫数据框中缺失的数据并计算条件统计?

1 年前

msts1906 · 大熊猫向乳胶的适当多品种出口

1 年前