Hadoop Cluster Access¶
This document describes getting access to the Hadoop cluster for the course.
Prerequisites¶
You will need
- An a account with FutureSystems
- To be a member of FutureSystems project 475
- Have uploaded an ssh key to the portal
Access¶
The cluster frontend is located at <IP_ADDRESS>, frontend ip
Login using ssh:
ssh -i $PATH_TO_SSH_PUBLIC_KEY $PORTAL_USERNAME@$HADOOP_IP
In the above:
$PATH_TO_SSH_PUBLIC_KEYis the location of the public key that has been added to the futuresystems portal$PORTAL_USERNAMEis the username on the futuresystems portal$HADOOP_IPis the IP address of the hadoop frontend node
Usage¶
Hadoop is installed under /opt/hadoop, and you can refer to this location
using $HADOOP_HOME.
See
hadoop fs
and
hadoop jar $HADOOP_HOME/share/hadoop/mapreduce/hadoop-mapreduce-examples*.jar
for more details.