Skip to content

Commit ffc8728

Browse files
committed
init Hadoop single node setup
1 parent 9c9b12f commit ffc8728

File tree

3 files changed

+17
-6
lines changed

3 files changed

+17
-6
lines changed

.gitignore

+2-1
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,4 @@
11
*.pyc
2+
.DS_Store
23
hb/
3-
.DS_Store
4+
tmp/

.mrjob.conf

+2-4
Original file line numberDiff line numberDiff line change
@@ -4,10 +4,8 @@
44
"base_tmp_dir": "./tmp/"
55
},
66
"hadoop": {
7-
"python_archives": ["$BT/aws/src-tree.tar.gz"],
8-
"upload_files": ["$BT/config/config.py", "$BT/config/secret.py"],
9-
"base_tmp_dir": "/scratch/$USER",
10-
"setup_cmds": ["ln -sf $(readlink -f config.py) src-tree.tar.gz/config/config.py", "ln -sf $(readlink -f secret.py) src-tree.tar.gz/config/secret.py", "cd src-tree.tar.gz; make -f Makefile.emr"]
7+
"hadoop_home" : "/Users/michau/hadoop-1.0.4",
8+
"base_tmp_dir": "/Users/michau/tmp/",
119
},
1210
"emr": {
1311
"s3_scratch_uri": "s3://walrus/tmp/",

README.md

+13-1
Original file line numberDiff line numberDiff line change
@@ -85,7 +85,19 @@ In order to query the mrlin datastore in HBase, use the [`mrlin query`](https://
8585

8686
* setup in virtual env: `source hb/bin/activate` then `pip install mrjob`
8787
* `cp .mrjob.conf ~` before launch
88-
* run `python mrlin_mr.py README.md`
88+
* `source hb/bin/activate`
89+
* run `python mrlin_mr.py README.md` for standalone
90+
* set up [Hadoop 1.0.4](http://ftp.heanet.ie/mirrors/www.apache.org/dist/hadoop/common/hadoop-1.0.4/hadoop-1.0.4.tar.gz) - if unsure follow a [single-node setup](http://orzota.com/blog/single-node-hadoop-setup-2/) tutorial
91+
* `cp .mrjob.conf ~` before launch if you change settings (!)
92+
* note all changes that were necessary in ` conf/core-site.xml`, `conf/mapred-site.xml`, `conf/hdfs-site.xml`, and `hadoop-env.sh` (provide examples)
93+
* run `python mrlin_mr.py -r hadoop README.md` for local Hadoop
94+
95+
96+
#### Debug
97+
98+
* `tail -f hadoop-michau-namenode-Michael-Hausenblas-iMac.local.log`
99+
100+
89101

90102
## License
91103

0 commit comments

Comments
 (0)