Program Arguments: 'nodes' 'edges' 'output_folder' 'number_of_iterations'
Example: "nodes.txt edges.txt output 5"
Steps to run on EC2:
- Login to cluster
- Send files from local machine to remote cluster
- (Optional) wget http://socialcomputing.asu.edu/uploads/1296759055/Twitter-dataset.zip and unzip the Twitter dataset.
- Copy your nodes and edges dataset's to HDFS
- To run (example), type: hadoop jar PageRank.jar nodes.txt edges.txt output 5