03_Assignment_SemII_2025-2026
Section outline
-
-
Opened: Wednesday, 25 March 2026, 12:00 AMDue: Thursday, 2 April 2026, 11:59 PM
It is assumed you have a PC or Laptop (with Windows or Mac OS systems) with a minimum of 8gb of RAM and 512gb or above of Hard disk storage, and i5 processor or other better compatible ones from different manufacturers.
The GOAL is to prepare a linux (UBUNTU flavour) environment for Hadoop installation.
a) Install Oracle virtualBox on your Windows/Mac OS.
b) After, install the latest Ubuntu OS inside the Oracle virtualBox. The user account on your UbuntuOs MUST be your given name.
c) Determine, prepare and establish that the environment is ready for Hadoop installation.
d) Install Hadoop on your Ubuntu and under your user account. Once done, verify that your Hadoop is working properly and health.
e) If health, find "NYC Taxi (Small Subset): ~366 MB (compressed) dataset", and load it on your Hadoop and identify appropriate tools to pick any insights of your choice from it while it is stored in Hadoop environment.
Note:
- You MUST document all processes you have gone through at each stage including screenshots with explanations.
- In your documentation, include the handles or challenges experienced and how you overcame them during all installations.
Lets get our hands a bit dirty today.
Submission is 30th March 2026 at 23:59hrs
-