Cloudera Vs Hortonworks

  • jenny 16289

    Valued Member

    Points: 59

    Comments posted to this topic are about the item Cloudera Vs Hortonworks

  • Michael Meierruth

    SSCrazy Eights

    Points: 9991

    Can you name some good and concise web sites or books on this hadoop technology that you would recomend.

  • jenny 16289

    Valued Member

    Points: 59

    You can go through following websites:

    http://www.greycampus.com/opencampus/hadoop-administrator

    http://www.tutorialspoint.com/hadoop/[/url]

  • ianstirk

    Ten Centuries

    Points: 1310

    Hi, for books, see this discussion : http://www.sqlservercentral.com/Forums/Topic1724468-3728-1.aspx

    thanks

    Ian

  • GShenanigan

    SSC Veteran

    Points: 213

    Hortonworks also offer a Sandbox, which is a free VM-based download that allows you to play about with the Hadoop ecosystem and learn the basics. I'd thoroughly recommend downloading that if you're interested in learning more.

    http://hortonworks.com/products/hortonworks-sandbox/

  • nfeuerstine

    Valued Member

    Points: 51

  • Paul Hernández

    SSCarpal Tunnel

    Points: 4880

    I think as part of the digital transformation, enterprise big data providers must include the customer feedback in their business models and product development process, and they already do that, at least Horton Works. That means, it is very important the chain from customers to either, Hortonworks and Cloudera, and finally the Apache development community.

    The key aspect is who can deliver what the customer really need or demand: Integrating them into the innovation process.

    I think is an error in the business model if someone try to replace the classic enterprise data warehouses. Better for me is to create additional value based on innovation, Lock-in of customers and partners, complementary products and services.

    Apart from that, one could try to capture new customers that never use business intelligence to support their decisions, and bring them into the world of advance analytics.

    Paul Hernández
  • xsevensinzx

    One Orange Chip

    Points: 25531

    Paul Hernández (10/20/2015)


    I think is an error in the business model if someone try to replace the classic enterprise data warehouses. Better for me is to create additional value based on innovation, Lock-in of customers and partners, complementary products and services.

    Apart from that, one could try to capture new customers that never use business intelligence to support their decisions, and bring them into the world of advance analytics.

    What do you mean by that?

    None of the vendors are trying to replace the Enterprise Data Warehouse. Very few people are trying to use Hadoop to do that either. The cost per terabyte is a lot cheaper in Hadoop versus SQL Server and there is distributed processing. Most people are trying to augment ETL and acquire fault-tolerant storage (that's cheaper) with Hadoop as the data source that feeds the EDW.

  • Paul Hernández

    SSCarpal Tunnel

    Points: 4880

    Hi,

    I read this in the article:

    Today, many have outlined the differentiating line as to how Cloudera and Hortonwork look at data warehouses, asserting that Hortonworks is willing to support the present data warehouse storage whereas Cloudera wants to cancel it completely

    I just overlooked the word "storage" and understood that Couldera wants to cancel the present data warehouse completely.

    Sorry for the confusion but my comment is still valid for the discussion Big Data vs. DW

    Paul Hernández
  • David Benoit

    SSC-Dedicated

    Points: 34562

    A key consideration in this space is where the code contributors are aligned, as that is a major factor when looking at any open source solution. All of the 3 major players, HortonWorks, Cloudera, and MapR, have value add, and all have areas of improvement. Understanding which vendor is helping the key contributors to developing the ecosystem is really critical when making your choice.

    http://hadoop.apache.org/who.html#Hadoop+Committers

    http://hortonworks.com/blog/reality-check-contributions-to-apache-hadoop/

    At least worthy of consideration....

    David

    @SQLTentmaker

    “He is no fool who gives what he cannot keep to gain that which he cannot lose” - Jim Elliot

  • Paul Hernández

    SSCarpal Tunnel

    Points: 4880

    Hi David, thanks for the explanation.

    It is what I wanted to say but in a understandable way.

    Paul Hernández
  • David Benoit

    SSC-Dedicated

    Points: 34562

    Paul Hernández (10/20/2015)


    Hi David, thanks for the explanation.

    It is what I wanted to say but in a understandable way.

    Thanks, Paul. I just wanted to provide some additional "color" to what you were stating by adding the links. 🙂

    David

    @SQLTentmaker

    “He is no fool who gives what he cannot keep to gain that which he cannot lose” - Jim Elliot

  • xsevensinzx

    One Orange Chip

    Points: 25531

    Paul Hernández (10/20/2015)


    Hi,

    I read this in the article:

    Today, many have outlined the differentiating line as to how Cloudera and Hortonwork look at data warehouses, asserting that Hortonworks is willing to support the present data warehouse storage whereas Cloudera wants to cancel it completely

    I just overlooked the word "storage" and understood that Couldera wants to cancel the present data warehouse completely.

    Sorry for the confusion but my comment is still valid for the discussion Big Data vs. DW

    Oh, I must have misread that too.

    I think the author of the article is misinterpreting Cloudera use of Enterprise Data Hub (EDH) in replace of Enterprise Data Warehouse (EDW).

    Cloudera has repeatedly said they are not trying to position themselves to replace the Enterprise Data Warehouse. They want to be a Enterprise Data Hub (EDH) that stands beside your data warehouse. It's shown through this graphic they have on their site that emphasizes their position:

    That being said, Cloudera is still Hadoop. It's HDFS, Impala, Hive and HBase. None of these technologies are designed to replace the data warehouse. They are designed to approach a diverse set of data problems that maybe SQL Server, MySQL, Oracle or even PostgreSQL don't handle that well.

  • mgrabenstein

    Grasshopper

    Points: 11

    Minor, author notes: ... there exists the concern of having financial support, which any pre-IPO start-ups are required to have in order to win over enterprise consumers. ....

    Cloudera is pre-IPO.

    Hortonworks is publicly traded: NASDAQ: HDP

    Currently $21.40 a share

  • marcus 10260

    SSC Rookie

    Points: 32

    Another point to note is that HortonWorks contributes code (a lot of code) to the Apache projects that make up the Hadoop system and has continuously done so and has something like 20 developers dedicated to the Apache Foundation projects which this utilises.

    Cloudera has barely contributed any and I can't remember the stat's on how many dev's they have but it's less than 5 off memory.

    Basically Cloudera leverages off the work that others put in, package some extra stuff & then state they are the best.

Viewing 15 posts - 1 through 15 (of 20 total)

You must be logged in to reply to this topic. Login to reply