SQL Clone
SQLServerCentral is supported by Redgate
 
Log in  ::  Register  ::  Not logged in
 
 
 


A Beginners Look at Hadoop


A Beginners Look at Hadoop

Author
Message
Dave Poole
Dave Poole
SSC-Insane
SSC-Insane (23K reputation)SSC-Insane (23K reputation)SSC-Insane (23K reputation)SSC-Insane (23K reputation)SSC-Insane (23K reputation)SSC-Insane (23K reputation)SSC-Insane (23K reputation)SSC-Insane (23K reputation)

Group: General Forum Members
Points: 23966 Visits: 3482
Comments posted to this topic are about the item A Beginners Look at Hadoop

LinkedIn Profile
www.simple-talk.com
paul s-306273
paul s-306273
SSCarpal Tunnel
SSCarpal Tunnel (4.3K reputation)SSCarpal Tunnel (4.3K reputation)SSCarpal Tunnel (4.3K reputation)SSCarpal Tunnel (4.3K reputation)SSCarpal Tunnel (4.3K reputation)SSCarpal Tunnel (4.3K reputation)SSCarpal Tunnel (4.3K reputation)SSCarpal Tunnel (4.3K reputation)

Group: General Forum Members
Points: 4318 Visits: 1190
Nice article, but I doubt I have the confidence to try this at home...
Dave Poole
Dave Poole
SSC-Insane
SSC-Insane (23K reputation)SSC-Insane (23K reputation)SSC-Insane (23K reputation)SSC-Insane (23K reputation)SSC-Insane (23K reputation)SSC-Insane (23K reputation)SSC-Insane (23K reputation)SSC-Insane (23K reputation)

Group: General Forum Members
Points: 23966 Visits: 3482
paul s-306273 (6/5/2013)
Nice article, but I doubt I have the confidence to try this at home...


With Virtual Box and Ubuntu its pretty easy to get started with Linux.
Using the Cloudera image is also very easy.

If you cock it all you will lose is a bit of disk space until such time as you delete the virtual image. Well, maybe a couple of evenings as well!

The only reason I haven't got it running on my home PC is that I've got an ancient 32bit PC with 3GB RAM running Windows XP and cannot run the Cloudera versions after CDH3 and CDH4.1 onwards are the ones where you have a GUI to play with.

Have a look at the SQLBits website for Justin Langford's presentation on HDInsight. Again, its very easy to install this on a Windows box and get started. The idea is that you have a local instance even though it is just one node. It's good enough to try the mechanical bits and bobs even if you haven't got the benefit of the full map-reduce.

LinkedIn Profile
www.simple-talk.com
paul s-306273
paul s-306273
SSCarpal Tunnel
SSCarpal Tunnel (4.3K reputation)SSCarpal Tunnel (4.3K reputation)SSCarpal Tunnel (4.3K reputation)SSCarpal Tunnel (4.3K reputation)SSCarpal Tunnel (4.3K reputation)SSCarpal Tunnel (4.3K reputation)SSCarpal Tunnel (4.3K reputation)SSCarpal Tunnel (4.3K reputation)

Group: General Forum Members
Points: 4318 Visits: 1190
Okay - I'll try that when I have some free evenings.

Thanks David.
Ross McMicken
Ross McMicken
SSCommitted
SSCommitted (1.6K reputation)SSCommitted (1.6K reputation)SSCommitted (1.6K reputation)SSCommitted (1.6K reputation)SSCommitted (1.6K reputation)SSCommitted (1.6K reputation)SSCommitted (1.6K reputation)SSCommitted (1.6K reputation)

Group: General Forum Members
Points: 1573 Visits: 2253
You might also want to take a look at Splunk. I've heard good things about its ability to analyze gigabytes of data quickly. http://www.splunk.com/?r=header
Paul Brewer
Paul Brewer
SSC Eights!
SSC Eights! (887 reputation)SSC Eights! (887 reputation)SSC Eights! (887 reputation)SSC Eights! (887 reputation)SSC Eights! (887 reputation)SSC Eights! (887 reputation)SSC Eights! (887 reputation)SSC Eights! (887 reputation)

Group: General Forum Members
Points: 887 Visits: 1328
Great article, the first I've read that clearly and simply explains what Hadoop is. Thanks
gclausen
gclausen
Valued Member
Valued Member (68 reputation)Valued Member (68 reputation)Valued Member (68 reputation)Valued Member (68 reputation)Valued Member (68 reputation)Valued Member (68 reputation)Valued Member (68 reputation)Valued Member (68 reputation)

Group: General Forum Members
Points: 68 Visits: 264
Great article!! Do you think it is worth learning a little bit of Java for this?
alen teplitsky
alen teplitsky
SSChampion
SSChampion (11K reputation)SSChampion (11K reputation)SSChampion (11K reputation)SSChampion (11K reputation)SSChampion (11K reputation)SSChampion (11K reputation)SSChampion (11K reputation)SSChampion (11K reputation)

Group: General Forum Members
Points: 11058 Visits: 4674
i'm looking at this as well. i've had it with full text indexing for a security log analysis solution i built up over the years. looking at analysis services and hadoop.

playing with SSAS for now and will try hadoop later.
Greg_Della-Croce
Greg_Della-Croce
Grasshopper
Grasshopper (12 reputation)Grasshopper (12 reputation)Grasshopper (12 reputation)Grasshopper (12 reputation)Grasshopper (12 reputation)Grasshopper (12 reputation)Grasshopper (12 reputation)Grasshopper (12 reputation)

Group: General Forum Members
Points: 12 Visits: 9
I enjoyed the article. Like others I have been in the Windows world for a very long time, and Linux is something of a new adventure for me. I just finished, with a lot of help from my friends, a RedHat 17 based NAS for my home network.

I am interested in the "WHAT" of Hadoop. As in what would it be used for in business? Would you or someone give some hard examples of uses?

GregDC
alex.d.garland
alex.d.garland
Mr or Mrs. 500
Mr or Mrs. 500 (527 reputation)Mr or Mrs. 500 (527 reputation)Mr or Mrs. 500 (527 reputation)Mr or Mrs. 500 (527 reputation)Mr or Mrs. 500 (527 reputation)Mr or Mrs. 500 (527 reputation)Mr or Mrs. 500 (527 reputation)Mr or Mrs. 500 (527 reputation)

Group: General Forum Members
Points: 527 Visits: 448
David.Poole (6/5/2013)

Have a look at the SQLBits website for Justin Langford's presentation on HDInsight. Again, its very easy to install this on a Windows box and get started. The idea is that you have a local instance even though it is just one node. It's good enough to try the mechanical bits and bobs even if you haven't got the benefit of the full map-reduce.


Hi David, I attended Justin's talk at SQLBits, don't know if you were also there in person (a good introduction I thought).

I had a quick word with him at the end and asked if there were any good walkthroughs or practical exercises for HDInsight newbies and he recommended this by Cindy Gross: http://blogs.msdn.com/b/cindygross/archive/2013/01/31/mash-up-hive-sql-server-data-in-powerpivot-amp-power-view-hurricane-sandy-2012.aspx

I haven't had a chance to work through it yet but may well do soon, like yourself I've recently got a Linux installation up and running (Fedora) but suspect that the MS version will be a gentler learning curve to start off with.
Go


Permissions

You can't post new topics.
You can't post topic replies.
You can't post new polls.
You can't post replies to polls.
You can't edit your own topics.
You can't delete your own topics.
You can't edit other topics.
You can't delete other topics.
You can't edit your own posts.
You can't edit other posts.
You can't delete your own posts.
You can't delete other posts.
You can't post events.
You can't edit your own events.
You can't edit other events.
You can't delete your own events.
You can't delete other events.
You can't send private messages.
You can't send emails.
You can read topics.
You can't vote in polls.
You can't upload attachments.
You can download attachments.
You can't post HTML code.
You can't edit HTML code.
You can't post IFCode.
You can't post JavaScript.
You can post emoticons.
You can't post or upload images.

Select a forum

































































































































































SQLServerCentral


Search