更多【GitHub热榜第四！这套Python机器学习课，免费获取还易吸收 | 资源】教程文章相关的互联网学习教程文章

【GitHub热榜第四！这套Python机器学习课，免费获取还易吸收 | 资源】教程文章相关的互联网学习教程文章

机器学习---用python实现感知机算法和口袋算法（Machine Learning PLA Pocket Algorithm Application）【代码】【图】

之前在《机器学习---感知机（Machine Learning Perceptron）》一文中介绍了感知机算法的理论知识，现在让我们来实践一下。有两个数据文件：data1和data2，分别用于PLA和Pocket Algorithm。可在以下地址下载：先回顾一下感知机算法： 1，初始化w 2，找出一个分类错误点 3，修正错误，假设迭代次数为t次（t=1,2,...），那么修正公式为： 4，直至没有分类错误点，返回最终的w 接下来让我们安照算法步骤，一步一步进行。首先...

吴裕雄 python 机器学习——支持向量机非线性回归SVR模型【代码】【图】

import numpy as np import matplotlib.pyplot as pltfrom sklearn import datasets, linear_model,svm from sklearn.model_selection import train_test_splitdef load_data_regression():加载用于回归问题的数据集diabetes = datasets.load_diabetes() #使用 scikit-learn 自带的一个糖尿病病人的数据集# 拆分成训练集和测试集，测试集大小为原始数据集大小的 1/4return train_test_split(diabetes.data,diabetes.target,test_si...

吴裕雄 python 机器学习——支持向量机线性分类LinearSVC模型【代码】【图】

import numpy as np import matplotlib.pyplot as pltfrom sklearn import datasets, linear_model,svm from sklearn.model_selection import train_test_splitdef load_data_classfication():加载用于分类问题的数据集# 使用 scikit-learn 自带的 iris 数据集iris=datasets.load_iris() X_train=iris.datay_train=iris.target# 分层采样拆分成训练集和测试集，测试集大小为原始数据集大小的 1/4return train_test_split(X_train, ...

用Python机器学习搞定验证码

Roy 马哥Linux运维 4月9日写爬虫有一个绕不过去的问题就是验证码，现在验证码分类大概有4种：图像类滑动类点击类语音类今天先来看看图像类，这类验证码大多是数字、字母的组合，国内也有使用汉字的。在这个基础上增加噪点、干扰线、变形、重叠、不同字体颜色等方法来增加识别难度。相应的，验证码识别大体可以分为下面几个步骤：灰度处理增加对比度(可选) 二值化降噪倾斜校正分割字符建立训练库识别由于是实验性质的...

吴裕雄 python 机器学习——KNN分类KNeighborsClassifier模型【代码】【图】

import numpy as np import matplotlib.pyplot as pltfrom sklearn import neighbors, datasets from sklearn.model_selection import train_test_splitdef load_classification_data():# 使用 scikit-learn 自带的手写识别数据集 Digit Datasetdigits=datasets.load_digits() X_train=digits.datay_train=digits.target# 进行分层采样拆分，测试集大小占 1/4return train_test_split(X_train, y_train,test_size=0.25,random_stat...

吴裕雄 python 机器学习——高斯贝叶斯分类器GaussianNB【代码】【图】

import matplotlib.pyplot as pltfrom sklearn import datasets,naive_bayes from sklearn.model_selection import train_test_split# 加载 scikit-learn 自带的 digits 数据集 def load_data():加载用于分类问题的数据集。这里使用 scikit-learn 自带的 digits 数据集digits=datasets.load_digits()return train_test_split(digits.data,digits.target,test_size=0.25,random_state=0,stratify=digits.target)def show_digits():...

python学习记录-机器学习

首先安装了anaconda3软件，安装的是最新版，安装时勾选了写入环境变量，支持的是python3.7.3版本。然后设置了清华大学的镜像，主要是用管理员身份运行 anaconda prompt命令行，然后执行conda config命令，然后执行conda config --add channels https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/free/?? 添加清华大学镜像下载地址 conda config --show channels 该命令行可以得到目前可以使用的下载地址，包括刚刚我们添加的镜...

吴裕雄 python 机器学习——线性判断分析LinearDiscriminantAnalysis【代码】【图】

import numpy as np import matplotlib.pyplot as pltfrom matplotlib import cm from mpl_toolkits.mplot3d import Axes3D from sklearn.model_selection import train_test_split from sklearn import datasets, linear_model,discriminant_analysisdef load_data():# 使用 scikit-learn 自带的 iris 数据集iris=datasets.load_iris()X_train=iris.datay_train=iris.targetreturn train_test_split(X_train, y_train,test_size=0...

吴裕雄 python 机器学习——岭回归【代码】【图】

import numpy as np import matplotlib.pyplot as pltfrom sklearn import datasets, linear_model from sklearn.model_selection import train_test_splitdef load_data():diabetes = datasets.load_diabetes()return train_test_split(diabetes.data,diabetes.target,test_size=0.25,random_state=0)#岭回归 def test_Ridge(*data):X_train,X_test,y_train,y_test=dataregr = linear_model.Ridge()regr.fit(X_train, y_train)pri...

吴裕雄 python 机器学习——Lasso回归【代码】【图】

import numpy as np import matplotlib.pyplot as pltfrom sklearn import datasets, linear_model from sklearn.model_selection import train_test_splitdef load_data():diabetes = datasets.load_diabetes()return train_test_split(diabetes.data,diabetes.target,test_size=0.25,random_state=0)#Lasso回归 def test_Lasso(*data):X_train,X_test,y_train,y_test=dataregr = linear_model.Lasso()regr.fit(X_train, y_train)...

萌新向Python数据分析及数据挖掘第三章机器学习常用算法第三节梯度下降法（上）理解篇【图】

理解特点作用最小化一个损失函数最大化一个效用函数：梯度上升法随机梯度下降法以单个数据作为梯度下降的依据优点批量梯度下降法以整体数据作为每次梯度下降的方向的根据小批量梯度下降法...

萌新向Python数据分析及数据挖掘第三章机器学习常用算法第三节梯度下降法（下）实操篇【代码】

In?[1]:?? ? ? ?from sklearn import datasets??In?[2]:?? ? ? ?boston = datasets.load_boston() X = boston.data y = boston.target #去除不真实的数据 X = X[y < 50] y = y[y < 50] ???In?[3]:?? ? ? ?from sklearn.model_selection import train_test_split #载入数据切分工具??In?[5]:?? ? ? ?X_train, X_test, y_train, y_test = train_test_split(X,y,test_size = 0.2,random_state=666) #切分数据??In?[6]:?? ? ? ?from...

萌新向Python数据分析及数据挖掘第三章机器学习常用算法第二节线性回归算法（上）理解篇【图】

理解以a b为变量，预测值与真值的差的平方和为结果的函数参数学习的基本方法：找到最优参数使得预测与真实值差距最小假设可以找到一条直线 y = ax+b 使得预测值与真值的差的平方和最小故事假设你面前有一堆男人这些男人的基本信息全部掌握，包括他们的年收入简单线性回归简单线性回归的思想就是：假设你相信，这些男人的身高越高，年收入越高，然后找到一个一元线性方程，让这个方程猜得最准，然后把这个方程记...

萌新向Python数据分析及数据挖掘第三章机器学习常用算法第一节 KNN算法（下）实操篇【代码】

import numpy as np from sklearn import datasets# 载入数据包??In?[2]:?? ? ? ?digits = datasets.load_digits()#读取数据 X = digits.data#定义X y = digits.target#定义y??In?[3]:?? ? ? ?from sklearn.model_selection import train_test_split #载入数据切分工具??In?[4]:?? ? ? ?X_train, X_test, y_train, y_test = train_test_split(X,y,test_size = 0.2)#数据切分???Signature: train_test_split(arrays, *options) Docs...

Python机器学习笔记异常点检测算法——Isolation Forest【图】

Isolation，意为孤立/隔离，是名词，其动词为isolate，forest是森林，合起来就是“孤立森林”了，也有叫“独异森林”，好像并没有统一的中文叫法。可能大家都习惯用其英文的名字isolation forest，简称iForest 。iForest适用于连续数据（Continuous numerical data）的异常检测，将异常定义为“容易被孤立的离群点（more likely to be separated）”——可以理解为分布稀疏且离密度高的群体较远的点。用统计学来解释，在数据空间里...

上一页
1
...
15
16
17
18
19
...
21
下一页
共 21 页
共 304 条