首页 / PYTHON / 利用python实现梯度下降和逻辑回归原理(Python详细源码：预测学生是否被录取)

利用python实现梯度下降和逻辑回归原理(Python详细源码：预测学生是否被录取)

内容导读

互联网集市收集整理的这篇技术教程文章主要介绍了利用python实现梯度下降和逻辑回归原理(Python详细源码：预测学生是否被录取)，小编现在分享给大家，供广大互联网技能从业者学习和参考。文章包含5047字，纯文字阅读大概需要8分钟。

内容图文

利用python实现梯度下降和逻辑回归原理(Python详细源码：预测学生是否被录取)

我们将建立一个逻辑回归模型来预测一个学生是否被大学录取。假设你是一个大学系的管理员，你想根据两次考试的结果来决定每个申请人的录取机会。你有以前的申请人的历史数据，你可以用它作为逻辑回归的训练集。对于每一个培训例子，你有两个考试的申请人的分数和录取决定。为了做到这一点，我们将建立一个分类模型，根据考试成绩估计入学概率。

导入函数库

#三大件
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
%matplotlib inline

导入数据，显示数据表头

import os
path = 'data' + os.sep + 'LogiReg_data.txt'
pdData = pd.read_csv(path, header=None, names=['Exam 1', 'Exam 2', 'Admitted'])
pdData.head()
pdData.shape
#看数据 的维度，100*3

结果：
利用python实现梯度下降和逻辑回归原理(Python详细源码：预测学生是否被录取) - 文章图片

显示数据，画图

positive = pdData[pdData['Admitted'] == 1] # returns the subset of rows such Admitted = 1, i.e. the set of *positive* examples
negative = pdData[pdData['Admitted'] == 0] # returns the subset of rows such Admitted = 0, i.e. the set of *negative* examples

fig, ax = plt.subplots(figsize=(10,5)) #指定画图域
ax.scatter(positive['Exam 1'], positive['Exam 2'], s=30, c='b', marker='o', label='Admitted')
ax.scatter(negative['Exam 1'], negative['Exam 2'], s=30, c='r', marker='x', label='Not Admitted')
ax.legend()
ax.set_xlabel('Exam 1 Score')
ax.set_ylabel('Exam 2 Score')

结果：
利用python实现梯度下降和逻辑回归原理(Python详细源码：预测学生是否被录取) - 文章图片

Sigmoid 函数

def sigmoid(z):
    return 1 / (1 + np.exp(-z))
    
nums = np.arange(-10, 10, step=1) #creates a vector containing 20 equally spaced values from -10 to 10
fig, ax = plt.subplots(figsize=(12,4))
ax.plot(nums, sigmoid(nums), 'r')

利用python实现梯度下降和逻辑回归原理(Python详细源码：预测学生是否被录取) - 文章图片

模型函数

def model(X, theta):
    """ Returns our model result
    :param X: examples to classify, n x p
    :param theta: parameters, 1 x p
    :return: the sigmoid evaluated for each examples in X given parameters theta as a n x 1 vector
    """
    return sigmoid(np.dot(X, theta.T))  # 矩阵乘法

利用python实现梯度下降和逻辑回归原理(Python详细源码：预测学生是否被录取) - 文章图片

pdData.insert(0, 'Ones', 1) # in a try / except structure so as not to return an error if the block si executed several times


# set X (training data) and y (target variable)
orig_data = pdData.as_matrix() # convert the Pandas representation of the data to an array useful for further computations
cols = orig_data.shape[1]
X = orig_data[:,0:cols-1] 
y = orig_data[:,cols-1:cols]

# convert to numpy arrays and initalize the parameter array theta
#X = np.matrix(X.values)
#y = np.matrix(data.iloc[:,3:4].values) #np.array(y.values)
theta = np.zeros([1, 3]) #占位 1*3，写一些检查一些

损失函数

利用python实现梯度下降和逻辑回归原理(Python详细源码：预测学生是否被录取) - 文章图片

def cost(X, y, theta):  #数据，标签，估计
    left = np.multiply(-y, np.log(model(X, theta)))
    right = np.multiply(1 - y, np.log(1 - model(X, theta)))
    return np.sum(left - right) / (len(X))
cost(X, y, theta)

计算梯度

def gradient(X, y, theta):
    grad = np.zeros(theta.shape)
    error = (model(X, theta)- y).ravel()
    for j in range(len(theta.ravel())): #for each parmeter
        term = np.multiply(error, X[:,j])
        grad[0, j] = np.sum(term) / len(X)
    
    return grad

比较3中不同梯度下降方法

STOP_ITER = 0 #按照迭代次数进行停止
STOP_COST = 1 #按照目标函数的变化进行停止，无变化停止
STOP_GRAD = 2 #按照梯度变化，无变化停止

def stopCriterion(type, value, threshold):
    #设定三种不同的停止策略
    if type == STOP_ITER:        return value > threshold
    elif type == STOP_COST:      return abs(value[-1]-value[-2]) < threshold
    elif type == STOP_GRAD:      return np.linalg.norm(value) < threshold
import numpy.random

#洗牌 提高泛化能力
def shuffleData(data):
    np.random.shuffle(data) #随机模块，洗牌函数
    cols = data.shape[1]
    X = data[:, 0:cols-1] #数据
    y = data[:, cols-1:]  #标签
    return X, y

import time

def descent(data, theta, batchSize, stopType, thresh, alpha):
    #梯度下降求解
    
    init_time = time.time()
    i = 0 # 迭代次数
    k = 0 # batch
    X, y = shuffleData(data)
    grad = np.zeros(theta.shape) # 计算的梯度
    costs = [cost(X, y, theta)] # 损失值

    
    while True:
        grad = gradient(X[k:k+batchSize], y[k:k+batchSize], theta)
        k += batchSize #取batch数量个数据
        if k >= n: 
            k = 0 
            X, y = shuffleData(data) #重新洗牌
        theta = theta - alpha*grad # 参数更新
        costs.append(cost(X, y, theta)) # 计算新的损失
        i += 1 

        if stopType == STOP_ITER:       value = i
        elif stopType == STOP_COST:     value = costs
        elif stopType == STOP_GRAD:     value = grad
        if stopCriterion(stopType, value, thresh): break
    
    return theta, i-1, costs, grad, time.time() - init_time

def runExpe(data, theta, batchSize, stopType, thresh, alpha):
    #import pdb; pdb.set_trace();
    theta, iter, costs, grad, dur = descent(data, theta, batchSize, stopType, thresh, alpha)
    name = "Original" if (data[:,1]>2).sum() > 1 else "Scaled"
    name += " data - learning rate: {} - ".format(alpha)
    if batchSize==n: strDescType = "Gradient"
    elif batchSize==1:  strDescType = "Stochastic"
    else: strDescType = "Mini-batch ({})".format(batchSize)
    name += strDescType + " descent - Stop: "
    if stopType == STOP_ITER: strStop = "{} iterations".format(thresh)
    elif stopType == STOP_COST: strStop = "costs change < {}".format(thresh)
    else: strStop = "gradient norm < {}".format(thresh)
    name += strStop
    print ("***{}\nTheta: {} - Iter: {} - Last cost: {:03.2f} - Duration: {:03.2f}s".format(
        name, theta, iter, costs[-1], dur))
    fig, ax = plt.subplots(figsize=(12,4))
    ax.plot(np.arange(len(costs)), costs, 'r')
    ax.set_xlabel('Iterations')
    ax.set_ylabel('Cost')
    ax.set_title(name.upper() + ' - Error vs. Iteration')
    return theta

内容总结

以上是互联网集市为您收集整理的利用python实现梯度下降和逻辑回归原理(Python详细源码：预测学生是否被录取)全部内容，希望文章能够帮你解决利用python实现梯度下降和逻辑回归原理(Python详细源码：预测学生是否被录取)所遇到的程序开发问题。如果觉得互联网集市技术教程内容还不错，欢迎将互联网集市网站推荐给程序员好友。

内容备注

版权声明：本文内容由互联网用户自发贡献，该文观点与技术仅代表作者本人。本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容，请发送邮件至 gblab@vip.qq.com 举报，一经查实，本站将立刻删除。

内容手机端

扫描二维码推送至手机访问。

本文链接：https://qyyshop.com/info/735947.html

来源：【匿名】

【上一篇】python 自定义报头实现大文件传输【下一篇】浅谈PHP运行Python脚本的方法

更多 ►

【利用python实现梯度下降和逻辑回归原理(Python详细源码：预测学生是否被录取)】教程文章相关的互联网学习教程文章

python实现随机森林、逻辑回归和朴素贝叶斯的新闻文本分类【代码】【图】

实现本文的文本数据可以在THUCTC下载也可以自己手动爬虫生成，本文主要参考：https://blog.csdn.net/hao5335156/article/details/82716923 nb表示朴素贝叶斯 rf表示随机森林 lg表示逻辑回归初学者（我）通过本程序的学习可以巩固python基础，学会python文本的处理，和分类器的调用。方便接下来的机器学习的学习。各个参数直观的含义：# -*- coding: utf-8 -*- """ Created on Thu Nov 29 13:00:46 2018@author: caoqu """ import...

用python实习逻辑回归【代码】【图】

建立一个逻辑回归模型来预测一个学生是否被大学录取# 三大件 import numpy as np import pandas as pd import matplotlib.pyplot as pltimport os path = ‘data‘ + os.sep + ‘LogiReg_data.txt‘ pdData = pd.read_csv(path, header=None, names=[‘Exam1‘, ‘Exam2‘, ‘Admitted‘]) print(pdData.head()) # 看一下数据的维度 print(pdData.shape) # 画图看一下每一个为 0 的数量和为 1 的数量 positive = pdData[pdData[‘A...

详解用TensorFlow实现逻辑回归算法【图】

这篇文章主要介绍了关于详解用TensorFlow实现逻辑回归算法，有着一定的参考价值，现在分享给大家，有需要的朋友可以参考一下本文将实现逻辑回归算法，预测低出生体重的概率。# Logistic Regression # 逻辑回归 #---------------------------------- # # This function shows how to use TensorFlow to # solve logistic regression. # y = sigmoid(Ax + b) # # We will use the low birth weight data, specifically: # y = 0 or 1...

机器学习算法的Python实现（二）：逻辑回归【代码】【图】

机器学习算法笔记（二）：逻辑回归在学习机器学习的过程中，结合数学推导和手写实现，可以加深对相关算法的认识。本部分教程将基于python实现机器学习的常用算法，来加强对算法的理解以及coding能力，仅供学习交流使用，请勿随意转载。本篇继续逻辑回归算法的学习，全文分为三个部分：数学推导 python实现逻辑回归优缺点分析一、逻辑回归的数学推导 ? 逻辑回归（LogisticRegression）名为回归，实为分类。逻辑回归可也可称为对...

0909案例实战：Python实现逻辑回归与梯度下降策略【代码】

根据成绩预测学生录取情况： import numpy as np import pandas as pd import matplotlib.pyplot as plt import numpy.random from sklearn import preprocessing as pp # 数据标准化 import time %matplotlib inline#洗牌 def shuffleData(data):np.random.shuffle(data)cols = data.shape[1]X = data[:, 0:cols-1]y = data[:, cols-1:]return X, y# 定义停止方式 STOP_ITER = 0 STOP_COST = 1 STOP_GRAD = 2def stopCriterion(...

逻辑回归（ROC、AUC、KS）-python实现-内含训练数据-测试数据【代码】【图】

一、逻辑回归理论：关注代码上线 Hypothesis Function（假设函数）：1.0/(1+exp(-inX))Cost Function（代价函数）：通过梯度下降法，求最小值。 weights(系数矩阵)=weights+alpha（固定值）*dataMatrix（特征指标）*error（真实值-预测值）二、运行效果第一组：第二组：第三组：三、python代码实现-梯度上升 import matplotlib.pyplot as plt import numpy as np from numpy import exp from sklearn.metrics import confu...

Python机器学习：逻辑回归002逻辑回归的损失函数【图】

python——sklearn完整例子整理示范（有监督，逻辑回归范例）（原创）【代码】【图】

sklearn使用方法，包括从制作数据集，拆分数据集，调用模型，保存加载模型，分析结果，可视化结果 1 import pandas as pd2 import numpy as np3 from sklearn.model_selection import train_test_split #训练测试集拆分4 from sklearn.linear_model import LogisticRegression #逻辑回归模型5 import matplotlib.pyplot as plt #画图函数6 7 from sklearn.externals import joblib #保存加载模型函数joblib8 9 #以下为sklearn评测...

局部加权之逻辑回归(1) - Python实现【代码】【图】

算法特征:利用sigmoid函数的概率含义, 借助回归之手段达到分类之目的. 算法推导:Part Ⅰsigmoid函数之定义:\begin{equation}\label{eq_1}sig(x) = \frac{1}{1 + e^{-x}}\end{equation}相关函数图像:由此可见, sigmoid函数将整个实数域$(-\infty, +\infty)$映射至$(0, 1)$区间内, 反映了一种良好概率意义下的映射关系. 对该函数进行如下扩展:\begin{equation}\label{eq_2}sig(\theta(x)) = \frac{1}{1 + e^{-\theta(x)}}\end{equati...

Python SKLearn：逻辑回归概率【代码】

我正在使用Python SKLearn模块执行逻辑回归.我有一个因变量矢量Y(从M个类中的1个取值)和独立变量矩阵X(具有N个特征).我的代码是LR = LogisticRegression()LR.fit(X,np.resize(Y,(len(Y))))我的问题是,LR.coef_和LR.intercept_代表什么.我最初以为他们持有的值intercept(i)和coef(i,j)s.t.log(p(1)/(1-p(1))) = intercept(1) + coef(1,1)*X1 + ... coef(1,N)*XN . . . log(p(M)/(1-p(M))) = intercept(M) + coef(M,1)*X1 + ... coef...

python-scikitlearn中的逻辑回归

您如何处理这样的图形：使用scikitlearn的LogisticRegression模型.有没有一种方法可以使用scikitlearn和映射为此类图的标准X,y输入轻松地处理这些类型的问题？解决方法:如果您真的想对这种特定设置使用Logistic回归,那么一种有前途的方法是将坐标从笛卡尔系统转换为极地系统.从可视化来看,似乎在该系统中,您的数据将(几乎)是线性可分离的. 可以按照以下说明进行操作：Python conversion between coordinates

如何使用python和scikit结合两个逻辑回归模型？【代码】

我是Python和Scikit新手.我有两个用Scikit创建的Logistic回归模型,我想将它们结合起来以获得新模型.在我看来是这样的：clf1 = LogisticRegression() clf1.fit(X_set, Y_set) clf2 = LogisticRegression() clf2.fit(X_set, Y_set) combined_clf = clf1 + clf2但是我不知道该怎么做.在此先感谢所有人.解决方法:这里有两种方法可以满足您的需求. 第一个是让您的每个分类器投票给预测的分类.为此,您可以使用sklearn.ensemble.VotingCla...