我试图在本地文件系统(独立模式)上测试我的计算机(MacOS 10.7)上的一个非常简单的hadoop map-reduce作业.该作业采用.csv文件(data-01)并计算某些字段的出现次数. 我下载了CDH4 hadoop,运行该作业,它似乎正常启动但是在处理完所有拆分后我得到以下错误:13/03/12 12:11:18 INFO mapred.MapTask: Processing split: file:/path/in/data-01:9999220736+33554432 13/03/12 12:11:18 INFO mapred.MapTask: Map output collector class =...
这是我的地图public static class MapClass extends Mapper<LongWritable, Text, Text, Text> {public void map(LongWritable key, Text value, Context context) throws IOException, InterruptedException{String[] fields = value.toString().split(",", -20);String country = fields[4];String numClaims = fields[8];if (numClaims.length() > 0 && !numClaims.startsWith("\"")) {context.write(new Text(country), new Tex...
我终于能够在Hadoop上启动map-reduce工作(在单个debian机器上运行).但是,map reduce作业始终失败,并显示以下错误:hadoopmachine@debian:~$./hadoop-1.0.1/bin/hadoop jar hadooptest/main.jar nl.mydomain.hadoop.debian.test.Main /user/hadoopmachine/input /user/hadoopmachine/output Warning: $HADOOP_HOME is deprecated.12/04/03 07:29:35 WARN mapred.JobClient: Use GenericOptionsParser for parsing the arguments. Ap...