Skip to content

Commit

Permalink
update notes in hadoop
Browse files Browse the repository at this point in the history
  • Loading branch information
linhsolar committed Dec 22, 2023
1 parent 5226a96 commit fc1490c
Showing 1 changed file with 6 additions and 0 deletions.
6 changes: 6 additions & 0 deletions tutorials/hadoop/hadoop-hive.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,6 +3,7 @@
The goal of this tutorial is to examine some features of the Hadoop software system, mainly with HDFS, YARN and Hive.

## 1. The Hadoop system for the tutorial

You can setup a Hadoop system by yourself or use our setup for practices.
> Check [the document of Hadoop](https://hadoop.apache.org/docs/stable/hadoop-project-dist/hadoop-common/SingleCluster.html)
We have setup a hadoop system for testing using [Google DataProc](https://cloud.google.com/dataproc/).
Expand All @@ -15,8 +16,13 @@ In order to access the the system for the tutorial, we have open ssh connections
Make sure that you have **ssh** installed in your machine.
>We will have only 1 account for all participants so DO NOT change the system configuration and account information.
Some important ports for check:
- **9870**: for Hadoop NameNode management. For example, goto http://localhost:9870 to check NameNode
- **8088**: for Resource Manager UI. For example, goto http://localhost:8088/to check the cluster
-
## 2. Exercises
### Access the system

```
ssh [USER_NAME]@[MASTER_IP]
```
Expand Down

0 comments on commit fc1490c

Please sign in to comment.