SQL Clone
SQLServerCentral is supported by Redgate
 
Log in  ::  Register  ::  Not logged in
 
 
 


One Database to Rule Them All


One Database to Rule Them All

Author
Message
Steve Jones
Steve Jones
SSC Guru
SSC Guru (225K reputation)SSC Guru (225K reputation)SSC Guru (225K reputation)SSC Guru (225K reputation)SSC Guru (225K reputation)SSC Guru (225K reputation)SSC Guru (225K reputation)SSC Guru (225K reputation)

Group: Administrators
Points: 225238 Visits: 19641
Comments posted to this topic are about the item One Database to Rule Them All

Follow me on Twitter: @way0utwest
Forum Etiquette: How to post data/code on a forum to get the best help
My Blog: www.voiceofthedba.com
Dave Poole
Dave Poole
SSC-Insane
SSC-Insane (24K reputation)SSC-Insane (24K reputation)SSC-Insane (24K reputation)SSC-Insane (24K reputation)SSC-Insane (24K reputation)SSC-Insane (24K reputation)SSC-Insane (24K reputation)SSC-Insane (24K reputation)

Group: General Forum Members
Points: 24312 Visits: 3495
It's worth reading Nathan Marz's book Big Data. It's an easy read but as a MEAP there are a few glaring typos.

It goes into the Lamda architecture in depth and the though processes underpinning it.

The message is that it is not just the tech underpinning what Twitter do it is the manner in which they use it.

As the blog posts suggest, they are trying to solve old problems by coming up with new blends of existing solutions. What they are doing goes far beyond what SQL Server can do and yet the "One DB to rule them all" approach really is where SQL Server has been moving

  • Traditional Relational engine

  • Column store engine

  • Filestream

  • Hekaton engine

  • XML processing - Hopefully they will add JSON to this

  • Analysis services engine

  • Data reader destinations in SSIS

  • Data dependent routing



LinkedIn Profile
www.simple-talk.com
David In BC
David In BC
SSC-Enthusiastic
SSC-Enthusiastic (142 reputation)SSC-Enthusiastic (142 reputation)SSC-Enthusiastic (142 reputation)SSC-Enthusiastic (142 reputation)SSC-Enthusiastic (142 reputation)SSC-Enthusiastic (142 reputation)SSC-Enthusiastic (142 reputation)SSC-Enthusiastic (142 reputation)

Group: General Forum Members
Points: 142 Visits: 376
I took an interesting one-day course from DataStax on Apache Cassandra - a big data solution used by Netflix, Comcast, eBay, and dozens of other companies.

Like what Twitter uses, it's designed to be easily scaled horizontally, and provides great performance, but with data-duplication and at the cost of consistency; you get eventual consistency, which is fine for social media, but not so hot for banks or financial institutions.

DataStax sells a commercial version (support, extra tools), so I was expecting some bias, but what struck me as amusing (and a little over the top) was the presenter calling RDBMSs "legacy" all day long.

Cobol is a legacy language - it's been superseded by languages that can do everything Cobol can do, and more.

Cassandra, and other big databases, can't do everything that SQL Server, Oracle, MySQL, etc, can do. In fact, big-data databases are kind of crappy at a lot of what an RDBMS can do with ease. I am confident Netflix doesn't use Cassandra as their billing database, for example. But maybe they use Cassandra to see if there is a correlation between the movies and tv-shows you watch and your credit card of choice.

Fortunately, both solutions play well with each other, just as an OLTP system integrates well with a data warehouse.
Dave Poole
Dave Poole
SSC-Insane
SSC-Insane (24K reputation)SSC-Insane (24K reputation)SSC-Insane (24K reputation)SSC-Insane (24K reputation)SSC-Insane (24K reputation)SSC-Insane (24K reputation)SSC-Insane (24K reputation)SSC-Insane (24K reputation)

Group: General Forum Members
Points: 24312 Visits: 3495
but what struck me as amusing (and a little over the top) was the presenter calling RDBMSs "legacy" all day long.


I've found that the techy types at the NOSQL vendors are usually pretty honest about what their use case and what doesn't fit their use case.

The marketing types are another matter entirely. One marketing type extolled the virtues of their distro of Hadoop stating that they could ingest 500GB in half an hour and that was way beyond the capabilities of traditional RDBMS. Clearly they hadn't seen the Microsoft article on loading 1TB in 30 minutes.

One of the SQLCat labs at SQL Bits (Liverpool) actually went into the details of how this was done and it was remarkably straight forward.

Of course the Hadoop marketeer didn't give details of what that 500GB looked like. Was it a simple log file or a complex structured JSON document. we'll never know!

Then there is the irritating tendency to demonstrate the NOSQL products superiority by using an awful schema in an RDBMS to say that the RDBMS can't cope. That's like saying that if you make Usain Bolt wear diving boots it prooves he's a lousy sprinter.

LinkedIn Profile
www.simple-talk.com
Eric M Russell
Eric M Russell
SSC-Forever
SSC-Forever (43K reputation)SSC-Forever (43K reputation)SSC-Forever (43K reputation)SSC-Forever (43K reputation)SSC-Forever (43K reputation)SSC-Forever (43K reputation)SSC-Forever (43K reputation)SSC-Forever (43K reputation)

Group: General Forum Members
Points: 43633 Visits: 12057
NoSQL databases are the digital equivalent of a basement level archive with pallets full of boxes more or less organized. They typically can't function as a primary database of record or even an operational data store. It's the transactional or meta-data layer that "rules" all the other disparate data stores. For example, on the Hadoop platform, HCatalog, the meta-data layer that maps files to logical tables names, must be a relational database like MySQL or SQL Server.


"The universe is complicated and for the most part beyond your control, but your life is only as complicated as you choose it to be."
Gary Varga
Gary Varga
SSC-Dedicated
SSC-Dedicated (35K reputation)SSC-Dedicated (35K reputation)SSC-Dedicated (35K reputation)SSC-Dedicated (35K reputation)SSC-Dedicated (35K reputation)SSC-Dedicated (35K reputation)SSC-Dedicated (35K reputation)SSC-Dedicated (35K reputation)

Group: General Forum Members
Points: 35496 Visits: 6562
Maybe NoSQL databases are more like the tourism leaflet stands. Quick to access. No guaranteed order. Exceptionally good at that task i.e. it is no filing cabinet!!!

Gaz

-- Stop your grinnin' and drop your linen...they're everywhere!!!
Dave Poole
Dave Poole
SSC-Insane
SSC-Insane (24K reputation)SSC-Insane (24K reputation)SSC-Insane (24K reputation)SSC-Insane (24K reputation)SSC-Insane (24K reputation)SSC-Insane (24K reputation)SSC-Insane (24K reputation)SSC-Insane (24K reputation)

Group: General Forum Members
Points: 24312 Visits: 3495
Unsupervised digital landfill

LinkedIn Profile
www.simple-talk.com
Eric M Russell
Eric M Russell
SSC-Forever
SSC-Forever (43K reputation)SSC-Forever (43K reputation)SSC-Forever (43K reputation)SSC-Forever (43K reputation)SSC-Forever (43K reputation)SSC-Forever (43K reputation)SSC-Forever (43K reputation)SSC-Forever (43K reputation)

Group: General Forum Members
Points: 43633 Visits: 12057
David.Poole (4/24/2014)
Unsupervised digital landfill

Unfortunately there are many IT shops who treat their relational database like a digital landfill: no thought for normalization, constraints, or optimization. Archived emails, website clicks, basically anything consisting of 0s and 1s, gets dumped into the landfill that gets so large it needs it own zip code.

When the database starts crashing, they ask themselves:
"Do we need a bigger dump truck for all this?"


"The universe is complicated and for the most part beyond your control, but your life is only as complicated as you choose it to be."
Go


Permissions

You can't post new topics.
You can't post topic replies.
You can't post new polls.
You can't post replies to polls.
You can't edit your own topics.
You can't delete your own topics.
You can't edit other topics.
You can't delete other topics.
You can't edit your own posts.
You can't edit other posts.
You can't delete your own posts.
You can't delete other posts.
You can't post events.
You can't edit your own events.
You can't edit other events.
You can't delete your own events.
You can't delete other events.
You can't send private messages.
You can't send emails.
You can read topics.
You can't vote in polls.
You can't upload attachments.
You can download attachments.
You can't post HTML code.
You can't edit HTML code.
You can't post IFCode.
You can't post JavaScript.
You can post emoticons.
You can't post or upload images.

Select a forum

































































































































































SQLServerCentral


Search