HDFS basic operations and configurations.
Commands:
1234567891011
$> hadoop version // show version of hadoop.$> hadoop fs // hdfs$> hadoop jar/***** File *****/// `hdfs dfs` equal to `hadoop fs`$> hdfs dfs mkdir -p /usr/centos/hadoop // create dir on hdfs.$> hdfs dfs -ls -R / // list dirs on hdfs.$> hdfs dfs -put <localfile> <hdfs dir> // upload a file to hdfs.$> hdfs dfs -rm <remote dir/file> // remove a dir/file on hdfs.$> hdfs dfs -appendToFile <localfile> <hdfsfile> // append a file on hdfs.$> hdfs dfs -cat <hdfsfile> // `cat` a file on hdfs.
Block storage
- Block size
- Disk seek time: 10ms
- Disk transmission timr: 100MB/s
- Block size: 128MB
- seek : transmission = 1 : 100
Config tmp directory
- Show all config item for each modules
Decompresshadoop-2.7.4.tar.gz
Get filesshare/hadoop/[common | hdfs | mapredude | yarn]/hadoop-xxxx-2.7.4.jar
(mapredude
–>hadoop-mapreduce-client-core-2.7.4.jar
)
Decompress that file and getxxxx-default.xml
core-site.xml
hadoop.tmp.dir
is/home/centos01/hadoop
123$> xsync hdfs-site.xml$> hdfs namenode -format // format namenode only$> start-dfs.sh // start hdfshdfs-site.xml
dfs.namenode.name.dir
isfile://${hadoop.tmp.dir}/dfs/namenode
// name node
dfs.namenode.data.dir
isfile://${hadoop.tmp.dir}/dfs/data
// data node
dfs.namenode.checkpoint.dir
isfile://${hadoop.tmp.dir}/dfs/namesecondary
// secondary name node123$> xsync hdfs-site.xml$> hdfs namenode -format // format namenode only$> start-dfs.sh // start hdfs
- Show all config item for each modules