Problems displaying this newsletter? View online.
SQL Server Central
Featured Contents
Question of the Day
Redgate SQL Monitor
The Voice of the DBA

What Happened to Hadoop?

Six or Seven years back, Hadoop was the big thing. It was going to solve our big data analytic needs, it would provide cheap storage and query power with commodity servers. More and more companies were going to be using it. Microsoft invested in HDInsight, SQL Server got Polybase to query data directly from HDFS. I was seeing the Hadoop elephant everywhere. I still remember popping into a few sessions at SQLBits to try and learn a bit more about how Hadoop worked.

In the last couple years, Hadoop has somewhat dropped off the radar as "the thing" that most companies need to become data driven and deal with large amounts of unstructured data. I found this analysis that talks about why Hadoop hasn't taken over the world.

The short answer? Real time needs, cloud computing, and containers. I think really, though, the complexity of Hadoop became a problem. It was too hard for most companies to deal with, and with too few willing to invest in the large infrastructure needed and skills required to manage a system. I'd say the same thing about Kubernetes, but it's evolving rapidly to become easier, and it's dirt simple in the cloud. I suspect we'll see more Kubernetes cloud deployments than on premises.

The other issue is the Hadoop batches jobs, which isn't what many organizations want. They already deal with, and complain about, plenty of relational batch jobs. Whether this is ETL to a warehouse, cube processing, or some other delay. They want queries on data in place, which is becoming more commonplace all the time.

Of course, one other important point from the piece is something I believe. The relational database, or data warehouse, is not going away. It's still important to many organizations, and it's useful to handle lots of reporting. With the growth of the SQL Server platform, you might even do more AI/ML analysis on your data in place, without the need to move it to an HDFS platform.

Steve Jones - SSC Editor

Join the debate, and respond to today's editorial on the forums

 Featured Contents

Introduction to STRING_SPLIT function in SQL

Archana from SQLServerCentral

This article explains the T-SQL function STRING_SPLIT() and demonstrates a creative use for it.

Get Your Scalar UDFs to Run Faster Without Code Changes

Additional Articles from SimpleTalk

Microsoft has added a group of features called Intelligent Query Processing to SQL Server 2017 and 2019. In this article, Greg Larsen explains one of the features, Scalar UDF Inlining.

New release: Mask SQL Server data 3x faster with Data Masker 7

Additional Articles from Redgate

In the latest version of Data Masker for SQL Server, it’s now much easier to rapidly identify sensitive data that needs masking, set up appropriate rules, see which masking rules are applied to which column, and locate what sensitive data you've yet to mask.
We’ve also improved the performance of commonly used masking rules, which lets you run masking operations 3 times faster.

From the SQL Server Central Blogs - SUMX returns incorrect results with duplicates

Koen Verbeeck from Koen Verbeeck

Okay, the title of this blog post could also have been “SUMX returns unexpected results with duplicates”. The results only seem incorrect because an incorrect assumption might have been...

From the SQL Server Central Blogs - Set a variable and update a column at the same time.

Kenneth.Fisher from SQLStudies

The other day I saw a weird option for the UPDATE command in a stack overflow answer. Basically it looked ... Continue reading


 Question of the Day

Today's question (by Steve Jones - SSC Editor):


Casting Tiny Numbers

What happens when I run this code?
DECLARE @val_tinyint tinyint = 100;
PRINT(CAST(@val_tinyint AS VARCHAR(2)));

Think you know the answer? Click here, and find out if you are right.



 Yesterday's Question of the Day (by Steve Jones - SSC Editor)

Returning Info from Stored Procedures

How many ways can I return info from stored procedures?

Answer: 3

Explanation: I can return information from a stored procedure in these ways:

  • Result sets
  • Output Parameters
  • Return values

Ref: Return Data from a Stored Procedure -

Discuss this question and answer on the forums




Database Pros Who Need Your Help

Here's a few of the new posts today on the forums. To see more, visit the forums.

SQL Server 2017 - Development
Taking a single 80 character piece of Text and converting to columns in a table. - Hi,   I have a 80 character piece of text that I wish to convert to a table structure that represents that data. Is there a simple way to take those 80 characters and basically insert them into the table structure without doing multiple substring commands for the individual columns.   Thanks in advance Andrew
DDL triger to grant select permission for new table/view - Dears,   Plz help with this topic- I need to grant permission for new table/views created in one schema when this are just created Thanks in advance, Hadrian
Resultset in a format - I have the following tables: create table #EmployeeSystemsMaster(EmployeeSystemID int, EmployeeSystemName nvarchar(150)) insert into #EmployeeSystemsMaster select 1, 'Core Banking' union select 2, 'Treasury' union select 3, 'Retail Banking' union select 4, 'Corporate Banking' union select 5, 'Anti Money Laundering' union select 6, 'Trade Finance' select * From #EmployeeSystemsMaster create table #NationalityMaster(NationalityID int, NationalityName nvarchar(150)) insert into […]
SQL Server 2016 - Administration
How to make DBCC CheckDB faster ? - Hi All We are running DBCC CheckDB in our server during low activity time. However it takes 2 until 3 hours to complete and CPU usage is 95 - 100 % during that time I want to make the process faster and reduce CPU usage if possible FYI : CPU = 2 - Development Server […]
SQL Server 2016 - Development and T-SQL
Creation of clustered primary key only utilizes 1 CPU core - Hi all, I am once again facing an interesting issue with multi-threading in SQL Server 2016. But first of all, this is the setup: we are using the Enterprise Edition (SP2) on a 24 core single CPU socket server with 5 PCIe NVME SSDs (2 TB each). Filegroups are distributed on all 5 indpendent SSDs. […]
Administration - SQL Server 2014
REPLICATION, Dropping Subscription Taking A Long Time - USE Published_DB; EXEC sp_dropsubscription @publication = N'Published_DB_Publication', @subscriber = N'all', @destination_db = N'Published_DB', @article = N'all'; I have a fairly large DB. It is replicating approx 7000 articles. When I run the script above, it's taking a long time (now 1 Hr+). Any ideas what I need to check here? Adding, I have all REPL […]
SQL 2012 - General
List of Dataabases from all the servers - Hi, We have 100+ servers and each servers contains few databases and i would like to get the all database list by each server/instance. Server1 DB1 DB2 DB3 Server2 DB1 DB2 DB3 DB4
SQL Server data masking - Can you able to mask data in sql server 2012? If yes, would it be easy to mask few tables or the complete database?  
Table self-join for derived columns - Hi, I have a requirement to derive additional columns and populate the values by doing a self-join. I have tried to use self-join and cross apply but not getting the desired results. I have attached the desired result format and also included the DDL and sample data. Could somebody please help in this regard. Thanks. […]
SQL Server 2019 - Administration
Question upgrading - Hi, I see the post of going from 2012 - 2019 but we were wondering if we should go to 2017 or 2019. Currently, we only have about 20 users and do not do anything with warehousing; of course, both could change down the road. Someone said that they found it to be much less […]
SQL Server 2019 - Development
CREATE LOGIN FROM Windows (AD account): Login failed for user - Hi, I have made with Server 2019 an VirtualBox test environment on my Workstation. I have : Active Directory Controller SQL Principal SQL Failover Web 01 Web 02 I develop an automate application that do this : Installation : Create Active Directory Windows Account Create Database on Principal with Failover Install Web Application on Web […]
Constraint on multiple columns - Hi, I want to restrict users from adding duplicate rows. Is there a way to put one constraint on multiple columns, or use a trigger to do this, maybe BEFORE insert/ BEFORE update? Thanks S
How to use next record's date as End date of current record? - I have a select statement which its result is something like this: I would like to have an end date column which is the next start date. I would appreciate it if someone could guide me.    
SMO Certificate Export - FailedOperationException - Greetings, Given this Powershell SMO method call... $_cert.Export("$CertificateLocation\$_cert_name.cer", ` "$PrivateKeyLocation\$_cert_name.pvk", ` $_secret) where $_cert is a Microsoft.SqlServer.Management.Smo.Certificate object and the arguments are plain strings (not secure strings). When I execute the code it returns a FailedOperationError.  I can't figure out why it is failing.  I have tested all the paths in the script to ensure […]
Analysis Services
Sharepoint 2019 and power pivot - We are migrating from Sharepoint 2013 on-prem to Sharepoint 2019 on-prem. In SP2013 we had Excel power pivot files (connection to SSAS multidimensional cube). A user used to open the file in SP, used it, also could save modified file and everything worked ok. From Sharepoint Server 2019 it seems that support for Power Pivot […]


RSS FeedTwitter

This email has been sent to {email}. To be removed from this list, please click here. If you have any problems leaving the list, please contact the This newsletter was sent to you because you signed up at
©2019 Redgate Software Ltd, Newnham House, Cambridge Business Park, Cambridge, CB4 0WZ, United Kingdom. All rights reserved.


- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -