Some hadoop commands

Jelly小于 1 分钟HadoopHadoop

Hadoop相关命令的使用

hadoop系统可继承一定的Linux系统命令, 以命令 ls 为例, 在hadoop系统中则需使用 hadoop dfs -ls.

特别的, hadoop有一对专门用于上传下载的命令, hadoop dfs -puthadoop dfs -get.

创建一个 speech.txt 文件, 其内容为

There is no mountain we cannot climb
There is no summit we cannot reach
There is no challenge we cannot meet
There is no victory we cannot have
We will not bend
We will not break
We will not yield
We will never give in
We will never give up
And we will never ever back down

我将其放在了hadoop目录下 ~/hadoop/speech.txt

将其传到hadoop系统的根目录下 /

hdfs dfs -put speech.txt /

使用hadoop官方示例 hadoop-mapreduce-examples-*.*.*.jar 统计 speech.txt 的单词数量

hadoop jar ~/hadoop/share/hadoop/mapreduce/hadoop-mapreduce-examples-3.2.4.jar wordcount /speech.txt /speechCount

查看统计结果

hdfs dfs -cat /speechCount/part-r-00000

webUI

我这里使用了docker的端口映射, 将9870端口映射到了我的windows宿主机上的19870端口.