java 获取网页指定内容-2(实践+修改)
内容导读
互联网集市收集整理的这篇技术教程文章主要介绍了java 获取网页指定内容-2(实践+修改),小编现在分享给大家,供广大互联网技能从业者学习和参考。文章包含2325字,纯文字阅读大概需要4分钟。
内容图文
import java.io.BufferedReader; import java.io.InputStreamReader; import java.net.HttpURLConnection; import java.net.URL; import java.util.regex.Matcher; import java.util.regex.Pattern; import java.util.Arrays; public class Weather { String urlString; String array; StringBuffer sb=new StringBuffer(""); publicstaticvoid main(String[] args) throws Exception { Weather client = new Weather("http://www.weather.com.cn/weather/101181201.shtml"); client.run(); } public Weather(String urlString) { this.urlString = urlString; } publicvoid run() throws Exception { URL url = new URL(urlString); HttpURLConnection urlConnection = (HttpURLConnection) url.openConnection(); BufferedReader reader = new BufferedReader(new InputStreamReader(urlConnection .getInputStream(),"utf8")); String line; while ((line = reader.readLine()) != null){ Pattern p = Pattern.compile("<p class=\"wea\">(.+?)</p>"); Matcher m = p.matcher(line); while(m.find()) { array = m.group(1); sb.append(array+","); } } String arr = sb.toString(); String[] s = arr.split("\\,"); System.out.println(s[s.length - 7]); } }
utf8编码格式
import java.io.BufferedReader; import java.io.InputStreamReader; import java.net.HttpURLConnection; import java.net.URL; import java.util.regex.Matcher; import java.util.regex.Pattern; import java.util.Arrays; public class Weather { String urlString; String array; StringBuffer sb=new StringBuffer(""); publicstaticvoid main(String[] args) throws Exception { Weather client = new Weather("http://www.weather.com.cn/weather/101181201.shtml"); client.run(); } public Weather(String urlString) { this.urlString = urlString; } publicvoid run() throws Exception { URL url = new URL(urlString); HttpURLConnection urlConnection = (HttpURLConnection) url.openConnection(); BufferedReader reader = new BufferedReader(new InputStreamReader(urlConnection .getInputStream(),"utf8")); String line; while ((line = reader.readLine()) != null){ Pattern p = Pattern.compile("<p class=\"wea\">(.+?)</p>"); Matcher m = p.matcher(line); while(m.find()) { array = m.group(1); sb.append(array+",");//符合正则的数据追加到sb,并以逗号分割 } } String arr = sb.toString();//sb转为字符串 String[] s = arr.split("\\,");//字符串转为数组,以逗号为标记 System.out.println(s[s.length - 7]);//取数组中倒数第7个数 } }
原文:http://www.cnblogs.com/hellowzd/p/4991796.html
内容总结
以上是互联网集市为您收集整理的java 获取网页指定内容-2(实践+修改)全部内容,希望文章能够帮你解决java 获取网页指定内容-2(实践+修改)所遇到的程序开发问题。 如果觉得互联网集市技术教程内容还不错,欢迎将互联网集市网站推荐给程序员好友。
内容备注
版权声明:本文内容由互联网用户自发贡献,该文观点与技术仅代表作者本人。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容, 请发送邮件至 gblab@vip.qq.com 举报,一经查实,本站将立刻删除。
内容手机端
扫描二维码推送至手机访问。