Friday, February 13, 2015

Hortonworks 2.2 (Sandbox)

I'm going to document some of my journey into the world of Hortonworks. I have worked in a Hadoop/Hive environment before and I am really impressed with new tools that have been added to that Apache stack.

I've just decided to download the Hortonworks Data Platform (HDP) version 2.2 (released Dec 2014) and play around with it. I have been planning on blogging about my experiences as a data scientist so I thought this might be a good time to start a blog and keep track of ideas and progress.

I downloaded the HDP 2.2 virtual environment for VirtualBox (VMWare and HyperV are also available) from

http://hortonworks.com/hdp/downloads/

And here's the install guide.

The image is 4.7 GB, about twice the size as the previous HDP 2.1. I'm not sure why it is so much larger, but they have added 4 (not even close to doubling) new tools.

After opening the VM, it shows instructions for two ways to interact with the Sandbox: through the browser

http://127.0.0.1:8888

and ssh

ssh root@127.0.0.1 -p 2222

I opened the browser (http://127.0.0.1:8888) and found that the ssh password is hadoop. I successfully logged in through ssh. I also followed the new link in the "Dive right in" section, which took me to the Sandbox landing page. The login is hue and the password is 1111, but I appear to be logged in upon arrival.

Now I'll start looking through the tutorials on the Hadoop for Data Scientists & Analysts page.

No comments:

Post a Comment