Hadoop实例测试完整版.doc_第1页
Hadoop实例测试完整版.doc_第2页
Hadoop实例测试完整版.doc_第3页
Hadoop实例测试完整版.doc_第4页
全文预览已结束

下载本文档

版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领

文档简介

1、建立一个测试的目录?rootlocalhost hadoop-1.1.1# bin/hadoop dfs -mkdir /hadoop/input2、建立测试文件rootlocalhost test# vi test.txthello hadoophello WorldHello JavaHey mani am a programmer3、将测试文件放到测试目录中rootlocalhost hadoop-1.1.1# bin/hadoop dfs -put ./test/test.txt /hadoop/input4、执行wordcount程序rootlocalhost hadoop-1.1.1# bin/hadoop jar hadoop-examples-1.1.1.jar wordcount /hadoop/input/* /hadoop/output/hadoop/output目录必须不存在,否则会报错:org.apache.hadoop.mapred.FileAlreadyExistsException: Output directory /hadoop/output already exists因为Hadoop执行的是耗费资源的运算,产生的结果默认是不能被覆盖的。执行成功的话,显示下面的信息:rootlocalhost hadoop-1.1.1# bin/hadoop jar hadoop-examples-1.1.1.jar wordcount /hadoop/input/* /hadoop/output13/01/17 00:36:06 INFO input.FileInputFormat: Total input paths to process : 113/01/17 00:36:06 INFO util.NativeCodeLoader: Loaded the native-hadoop library13/01/17 00:36:06 WARN snappy.LoadSnappy: Snappy native library not loaded13/01/17 00:36:07 INFO mapred.JobClient: Running job: job_201301162205_000613/01/17 00:36:08 INFO mapred.JobClient: map 0% reduce 0%13/01/17 00:36:14 INFO mapred.JobClient: map 100% reduce 0%13/01/17 00:36:22 INFO mapred.JobClient: map 100% reduce 33%13/01/17 00:36:24 INFO mapred.JobClient: map 100% reduce 100%13/01/17 00:36:25 INFO mapred.JobClient: Job complete: job_201301162205_000613/01/17 00:36:25 INFO mapred.JobClient: Counters: 2913/01/17 00:36:25 INFO mapred.JobClient: Job Counters 13/01/17 00:36:25 INFO mapred.JobClient: Launched reduce tasks=113/01/17 00:36:25 INFO mapred.JobClient: SLOTS_MILLIS_MAPS=686313/01/17 00:36:25 INFO mapred.JobClient: Total time spent by all reduces waiting after reserving slots (ms)=013/01/17 00:36:25 INFO mapred.JobClient: Total time spent by all maps waiting after reserving slots (ms)=013/01/17 00:36:25 INFO mapred.JobClient: Launched map tasks=113/01/17 00:36:25 INFO mapred.JobClient: Data-local map tasks=113/01/17 00:36:25 INFO mapred.JobClient: SLOTS_MILLIS_REDUCES=920713/01/17 00:36:25 INFO mapred.JobClient: File Output Format Counters 13/01/17 00:36:25 INFO mapred.JobClient: Bytes Written=7813/01/17 00:36:25 INFO mapred.JobClient: FileSystemCounters13/01/17 00:36:25 INFO mapred.JobClient: FILE_BYTES_READ=12813/01/17 00:36:25 INFO mapred.JobClient: HDFS_BYTES_READ=17013/01/17 00:36:25 INFO mapred.JobClient: FILE_BYTES_WRITTEN=4805913/01/17 00:36:25 INFO mapred.JobClient: HDFS_BYTES_WRITTEN=7813/01/17 00:36:25 INFO mapred.JobClient: File Input Format Counters 13/01/17 00:36:25 INFO mapred.JobClient: Bytes Read=6213/01/17 00:36:25 INFO mapred.JobClient: Map-Reduce Framework13/01/17 00:36:25 INFO mapred.JobClient: Map output materialized bytes=12813/01/17 00:36:25 INFO mapred.JobClient: Map input records=513/01/17 00:36:25 INFO mapred.JobClient: Reduce shuffle bytes=12813/01/17 00:36:25 INFO mapred.JobClient: Spilled Records=2213/01/17 00:36:25 INFO mapred.JobClient: Map output bytes=11013/01/17 00:36:25 INFO mapred.JobClient: CPU time spent (ms)=165013/01/17 00:36:25 INFO mapred.JobClient: Total committed heap usage (bytes)01/17 00:36:25 INFO mapred.JobClient: Combine input records=1213/01/17 00:36:25 INFO mapred.JobClient: SPLIT_RAW_BYTES=10813/01/17 00:36:25 INFO mapred.JobClient: Reduce input records=1113/01/17 00:36:25 INFO mapred.JobClient: Reduce input groups=1113/01/17 00:36:25 INFO mapred.JobClient: Combine output records=1113/01/17 00:36:25 INFO mapred.JobClient: Physical memory (bytes) snapshot01/17 00:36:25 INFO mapred.JobClient: Reduce output records=1113/01/17 00:36:25 INFO mapred.JobClient: Virtual memory (bytes) snapshot=75624448013/01/17 00:36:25 INFO mapred.JobClient: Map output records=12rootlocalhost hadoop-1.1.1# 5、查看结果wordcount程序统计目标文件中的单词个数,将结果输出到/hadoop/output/part-r-00000文件中rootlocalhost hadoop-1.1.1# bin/hadoop dfs -ls /hadoop/outputFound 3 items-rw-r-r- 1 root supergroup 0 2013-01-17 00:36 /hadoop/output/_SUCCESSdrwxr-xr-x - root supergroup 0 2013-01-17 00:36 /hadoop/output/_logs-rw-r-r- 1 root supergroup 78 2013-01-17 00:36 /hadoop/output

温馨提示

  • 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
  • 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
  • 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
  • 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
  • 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
  • 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
  • 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。

最新文档

评论

0/150

提交评论