首页 / PYTHON / Python Web Scraping表返回None

Python Web Scraping表返回None

内容导读

互联网集市收集整理的这篇技术教程文章主要介绍了Python Web Scraping表返回None，小编现在分享给大家，供广大互联网技能从业者学习和参考。文章包含1282字，纯文字阅读大概需要2分钟。

内容图文

我正在尝试从www.intellicast.com刮擦桌子的温度元件

soup =  BeautifulSoup(urllib2.urlopen('http://www.intellicast.com/Local/History.aspx?location=USTX0057').read())
for row in soup('table',{'id':'dailyClimate'})[0].tbody('tr'):
  tds=row
  print tds

结果：
TypeErrorL’NoneType’对象不可调用

当查看页面源代码时,我可以看到

<table id = "dailyClimate" class="Container">
  <tbody>
    <tr class="TitlesAvgRecord">
       <td..
    <td>...</td>

因此,我知道有一个tbody和一个tr元素.

如果将.tbody(‘td’)的.tbody(‘tr’)更改,我仍然会收到错误,因此我假设我假设错误是在调用tbody的某个地方.

解决方法:

您的浏览器会插入< tbody>元素,但实际来源没有该元素：

<table id="dailyClimate" class="Container">
  <tr class="TitlesAvgRecord">
    <td style="padding-left:5px;">Date</td>
    <td>Average<br />Low</td>
    <td>Average<br />High</td>
    <td>Record<br />Low</td>
    <td>Record<br />High</td>
    <td>Average<br />Precipitation</td>
    <td>Average<br />Snow</td>
  </tr>

<!-- etc. -->

查看Why do browsers insert tbody element into table elements?

您可以改用html5lib解析器(使用BeautifulSoup(source,’html5lib’)),它也可以插入元素.但是,您无需搜索它,只需直接转到< tr>行数：

for row in soup.find('table', id='dailyClimate').find_all('tr'):

或使用CSS选择器：

for row in soup.select('table#dailyClimate tr'):

通常,只有在可能有多个元素或者要排除的thead或tfooter元素时,才选择tbody元素.

内容总结

以上是互联网集市为您收集整理的Python Web Scraping表返回None全部内容，希望文章能够帮你解决Python Web Scraping表返回None所遇到的程序开发问题。如果觉得互联网集市技术教程内容还不错，欢迎将互联网集市网站推荐给程序员好友。

内容备注

版权声明：本文内容由互联网用户自发贡献，该文观点与技术仅代表作者本人。本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容，请发送邮件至 gblab@vip.qq.com 举报，一经查实，本站将立刻删除。

内容手机端

扫描二维码推送至手机访问。

本文链接：https://qyyshop.com/info/659504.html

来源：【匿名】

【上一篇】Python win32com.client和Outlook 【下一篇】浅谈PHP运行Python脚本的方法

更多 ►

【Python Web Scraping表返回None】教程文章相关的互联网学习教程文章

Python for Data Science - Web scraping【代码】

Chapter 6 - Data Sourcing via Web Segment 4 - Web scraping from bs4 import BeautifulSoup import urllib.request from IPython.display import HTML import rer = urllib.request.urlopen('https://analytics.usa.gov/').read() soup = BeautifulSoup(r, "lxml") type(soup)bs4.BeautifulSoupprint(soup.prettify()[:100])<!DOCTYPE html> <html lang="en"><head><!--for...

Python Web Scraping表返回None【代码】

我正在尝试从www.intellicast.com刮擦桌子的温度元件soup = BeautifulSoup(urllib2.urlopen('http://www.intellicast.com/Local/History.aspx?location=USTX0057').read()) for row in soup('table',{'id':'dailyClimate'})[0].tbody('tr'):tds=rowprint tds结果：TypeErrorL’NoneType’对象不可调用当查看页面源代码时,我可以看到<table id = "dailyClimate" class="Container"><tbody><tr class="TitlesAvgRecord"><td..<td>....

Web Scraping using Python Scrapy_BS4 - using BeautifulSoup and Python【代码】【图】

Use BeautifulSoup and Python to scrap a website Lib:urllib Parsing HTML DataWeb scraping scriptfrom urllib.request import urlopen as uReq from bs4 import BeautifulSoup as soupquotes_page = "https://bluelimelearning.github.io/my-fav-quotes/" uClient = uReq(quotes_page) page_html = uClient.read() uClient.close() page_soup = soup(page_html, "html.parser") quotes = page_soup.findAll("div", {"class":"q...

Python – Web Scraping HTML表格和打印到CSV【代码】

我几乎是Python的新手,但我正在寻找一个网络编写工具,它将在线从HTML表中删除数据并以相同的格式将其打印成CSV. 这是HTML表的一个示例(它是巨大的,所以我将只提供几行).<div class="col-xs-12 tab-content"><div id="historical-data" class="tab-pane active"><div class="tab-header"><h2 class="pull-left bottom-margin-2x">Historical data for Bitcoin</h2><div class="clear"></div><div class="row"><div class="col-md-12...

python – Scraping：SSL：http：//en.wikipedia.org的CERTIFICATE_VERIFY_FAILED错误【代码】

我正在练习’Web Scraping with Python‘的代码,我一直有这个证书问题：from urllib.request import urlopen from bs4 import BeautifulSoup import repages = set() def getLinks(pageUrl):global pageshtml = urlopen("http://en.wikipedia.org"+pageUrl)bsObj = BeautifulSoup(html)for link in bsObj.findAll("a", href=re.compile("^(/wiki/)")):if 'href' in link.attrs:if link.attrs['href'] not in pages:#We have enco...

使用Selenium和Beautiful Soup的Python Scraping JavaScript【代码】

我正在尝试使用BS和Selenium抓取一个JavaScript启用页面.到目前为止,我有以下代码.它仍然不会以某种方式检测JavaScript(并返回一个空值).在这种情况下,我试图在底部刮掉Facebook的评论. (Inspect元素将类显示为postText)谢谢您的帮助！from selenium import webdriver from selenium.common.exceptions import NoSuchElementException from selenium.webdriver.common.keys import Keys import BeautifulSoupbrowser = webdr...

python – asyncio web scraping 101：使用aiohttp获取多个url【代码】

在之前的问题中,aiohttp的一位作者使用Python 3.5中的新async语法建议使用fetch multiple urls with aiohttp：import aiohttp import asyncioasync def fetch(session, url):with aiohttp.Timeout(10):async with session.get(url) as response:return await response.text()async def fetch_all(session, urls, loop):results = await asyncio.wait([loop.create_task(fetch(session, url))for url in urls])return resultsif __n...

python – 错误1(HY000)：无法创建/写入文件’./scraping/db.opt'(错误代码：2)【代码】

通过“使用Python进行Web Scraping”工作,我就是你使用MySQL的部分.在Google上找不到任何对此错误消息特别有帮助的内容 – 你们中的任何人都可以帮我解码吗？ (并希望弄清楚如何解决它？！)我在输入命令后收到错误：ALTER DATABASE scraping CHARACTER set = utf8mb4 COLLATE = utf8mb4_unicode_ci;输出：ERROR 1 (HY000): Can't create/write to file './scraping/db.opt' (Errcode: 2) mysql> 解决方法:您必须确保您的数据库名为...

Twitter Scraping重复执行代码(python)【代码】

这是一个Twitter抓取代码,用于提取包含着名关键字的推文. 我想每12小时重复下面的整个代码. (或12小时10分钟休息).你可以给我重复短语的建议吗？import tweepy import time import os import json import simplejsonsearch_term = 'word1' search_term2= 'word2' search_term3='word3'lat = "xxxx" lon = "xxxx" radius = "xxxx" location = "%s,%s,%s" % (lat, lon, radius)API_key = "xxxx" API_secret = "xxxx" Access_token = ...

python – speako的API？ Scraping Spokeo

是否有讲话的API？我想以json或xml格式获得结果,我尝试为它找到api但不能.有没有人尝试用或不用api刮取speako？我确信我们可以通过一般方式进行搜索,但是当搜索结果出现多个位置区域时,我不知道如何继续.谢谢解决方法:根据Spokeo的terms of use,明确禁止使用刮刀,任何“衍生作品”也是如此 – 即使所有这些作品都是来自其网站的框架内容. 如果您在公开的应用程序中发布它,请准备好为它做一些准备.

Scraping Website Using Python

5 library recommended for scraping website, https://elitedatascience.com/python-web-scraping-librariesThe Farm: Requests The Stew: Beautiful Soup 4 The Salad: lxml The Restaurant: Selenium The Chef: ScrapyResourcesRequests Quickstart Guide – Official documentation. Covers practical topics like passing parameters, handling responses, and configuring headers.Beautiful Soup Documentation – Includes...

scraping website using python

python特点：强大的第三方库。优缺点：较小的代价建造工程，也要求我们对库要有了解。基本特性： 1. 面向对象，动态编译，脚本语言 2. 不依赖平台 3. 对接了几乎所有系统操作的API python是用c实现的，依赖于扩展的，易于理解的可移植c库。和unix无缝对接，在非unix系统下也能相似运行。可选IDE：Komodo, VIM, EMACS, TEXTPAD, BBEDIT 学习目标： 1.variables, statements, exceptions, functions 2.类和子类的定义，包括...

PYTHON - 技术教程分类

Python3 教程 Python3 简介 Python3 环境搭建 Python3 基础语法 Python3 基本数据类型 Python3 解释器 Python3 注释 Python3 运算符 Python3 数字(Number) Python3 字符串 Python3 列表 Python3 元组 Python3 字典 Python3 集合 Python3 编程第一步 Python3 条件控制 Python3 循环语句 Python3 迭代器与生成器 Python3 函数 Python3 数据结构 Python3 模块 Python3 输入和输出 Python3 File Python3 OS Python3 错误和异常 Python3 面向对象 Python3 命名空间/作用域 Python3 标准库概览 Python3 实例 Python3 CGI编程 Python3 MySQL(PyMySQL) Python3 网络编程 Python3 SMTP发送邮件 Python3 多线程 Python3 日期和时间 Python3 内置函数 Python3 MongoDB Python3 urllib python 全部

PYTHON - 最热教程

python如何统计字符串中字母个数？使用Python进行微信公众号开发（三）回...Python+PyQT5的子线程更新UI界面的实例 python时间戳怎么获得？如何获得当前时...vscode调试python时提示无法将“conda”...python接口自动化全局变量access_token...python收取邮件(腾讯企业邮箱)python如何绘制降水图详解python并发获取snmp信息及性能测试...怎么卸载Python3.6？

首页 / PYTHON / Python Web Scraping表返回None

Python Web Scraping表返回None

内容导读

内容图文

内容总结

内容备注

内容手机端

【Python Web Scraping表返回None】教程文章相关的互联网学习教程文章

Python for Data Science - Web scraping【代码】

Python Web Scraping表返回None【代码】

Web Scraping using Python Scrapy_BS4 - using BeautifulSoup and Python【代码】【图】

Python – Web Scraping HTML表格和打印到CSV【代码】

python – Scraping：SSL：http：//en.wikipedia.org的CERTIFICATE_VERIFY_FAILED错误【代码】

使用Selenium和Beautiful Soup的Python Scraping JavaScript【代码】

python – asyncio web scraping 101：使用aiohttp获取多个url【代码】

python – 错误1(HY000)：无法创建/写入文件’./scraping/db.opt'(错误代码：2)【代码】

Twitter Scraping重复执行代码(python)【代码】

python – speako的API？ Scraping Spokeo

Scraping Website Using Python

scraping website using python

PYTHON - 相关标签

WEB - 相关标签

PYTHON - 技术教程分类

PYTHON - 最新教程

PYTHON - 最热教程