PRACTICAL NO.
4
Aim: Installation of HIVE & Performing basic commands in Cloudera
Installation of Cloudera in VirtualBox Steps
1. Download Cloudera QuickStart VM:
o Visit the Cloudera website and download the Cloudera QuickStart VM for
VirtualBox (Cloudera may have discontinued QuickStart VM, so if it's not
available, use other Hadoop distributions or cloud-based solutions).
2. Install VirtualBox:
o Download and install VirtualBox from VirtualBox's official site.
3. Import the Cloudera VM into VirtualBox:
o Open VirtualBox.
o Click on "File" > "Import Appliance."
o Browse and select the downloaded Cloudera QuickStart VM .ovf file.
o Click "Next" and then "Import" to import the VM.
4. Configure VM Settings :
o Select the imported VM.
o Click on "Settings" to modify memory, CPU, and other settings as per your
system's capabilities.
5. Start the VM:
o Select the Cloudera QuickStart VM in VirtualBox.
o Click "Start" to boot up the VM.
o Wait for the VM to start up, and the Cloudera Manager interface should
appear.
Basic HDFS Commands:
1. Create a Directory (mkdir):
[hdfs dfs -mkdir /user/hive/warehouse/mydirectory]
2. List Directory Contents (ls):
[hdfs dfs -ls /user/hive/warehouse/]
3. Copy File from Local to HDFS (copyFromLocal):
[hdfs dfs -copyFromLocal /path/to/localfile /user/hive/warehouse/]
4. Create an Empty File (touchz):
[hdfs dfs -touchz /user/hive/warehouse/emptyfile.txt]
5. Display File Contents (cat):
[hdfs dfs -cat /user/hive/warehouse/filename.txt]
6. Upload File to HDFS (put):
[hdfs dfs -put /path/to/localfile /user/hive/warehouse/]
7.Download File from HDFS (get):
[hdfs dfs -get /user/hive/warehouse/filename.txt /path/to/local/]