Click here to monitor SSC
SQLServerCentral is supported by Red Gate Software Ltd.
 
Log in  ::  Register  ::  Not logged in
 
 
 

Outlier Detection with SQL Server, part 3.5: The Modified Thompson Tau Test

By Steve Bolton

…………Based on what little experience I’ve gained from writing this series on finding outliers in SQL Server databases, I expected the Modified Thompson Tau test to be a clunker. It marries the math underpinning one of the most ubiquitous means of outlier detection, Z-Scores, with the… Read more

0 comments, 3,969 reads

Posted in Multidimensional Mayhem on 14 February 2015

Outlier Detection with SQL Server, part 3.4: Dixon’s Q-Test

By Steve Bolton

…………In the last three installments of this amateur series of mistutorials on finding outliers using SQL Server, we delved into a subset of standard detection methods taken from the realm of statistical hypothesis testing. These are generally more difficult to apply to tables of thousands of… Read more

0 comments, 265 reads

Posted in Multidimensional Mayhem on 30 January 2015

Outlier Detection with SQL Server, part 3.3: The Limitations of the Tietjen-Moore Test

By Steve Bolton

…………The Tietjen-Moore test may have the coolest-soundest name of any of the outlier detection methods I’ll be surveying haphazardly in this amateur series of mistutorials, yet it suffers from some debilitating limitations that may render it among the least useful for SQL Server DBAs. It is… Read more

0 comments, 146 reads

Posted in Multidimensional Mayhem on 20 January 2015

Outlier Detection with SQL Server, part 3.2: GESD

By Steve Bolton

…………In the last edition of this amateur series of self-tutorials on finding outlying values in SQL Server columns, I mentioned that Grubbs’ Test has a number of limitations that sharply constrain its usefulness to DBAs. The Generalized Extreme Studentized Deviate Test (GESD) suffers from some of… Read more

0 comments, 5,214 reads

Posted in Multidimensional Mayhem on 17 December 2014

Outlier Detection with SQL Server, part 3.1: Grubbs’ Test


By Steve Bolton

…………In the last two installments of this series of amateur self-tutorials, I mentioned that the various means of detecting outliers with SQL Server might best be explained as a function of the uses cases, the context determined by the questions one chooses to ask of the… Read more

2 comments, 5,932 reads

Posted in Multidimensional Mayhem on 29 November 2014

Outlier Detection with SQL Server, part 2.2: Modified Z-Scores

By Steve Bolton

…………There are apparently many subtle variations on Z-Scores, a ubiquitous measure that is practically a cornerstone in the foundation of statistics. The popularity and ease of implementation of Z-Scores are what made me decide to tackle them early on in this series of amateur self-tutorials, on… Read more

0 comments, 389 reads

Posted in Multidimensional Mayhem on 13 November 2014

Outlier Detection with SQL Server, part 2.1: Z-Scores

By Steve Bolton

…………Using SQL Server to ferret out those aberrant data points we call outliers may call for some complex T-SQL, Multidimensional Expressions (MDX) or Common Language Runtime (CLR) code. Yet thankfully, the logic and math that underpin the standard means of outlier detection I’ll delve into in… Read more

2 comments, 628 reads

Posted in Multidimensional Mayhem on 28 October 2014

Outlier Detection with SQL Server, part 1: Finding Fraud and Folly with Benford’s Law

By Steve Bolton

…………My last blog series, A Rickety Stairway to SQL Server Data Mining, often epitomized a quip by University of Connecticut statistician Daniel T. Larose, to the effect that “data mining is easy to do badly.”[1] It is clear that today’s sophisticated mining algorithms can still… Read more

2 comments, 1,245 reads

Posted in Multidimensional Mayhem on 19 September 2014

Stay Tuned…for a SQL Server Tutorial Series Juggling Act

by Steve Bolton

…………If all goes according to plan, my blog will return in a few weeks with two brand new series, Using Other Data Mining Tools with SQL Server and Information Measurement with SQL Server. Yes, I will be attempting what amounts to a circus act among SQL… Read more

0 comments, 213 reads

Posted in Multidimensional Mayhem on 1 July 2014

A Rickety Stairway to SQL Server Data Mining, Part 15, The Grand Finale: Custom Data Mining Viewers


By Steve Bolton

…………As mentioned previously in this amateur self-tutorial series on the most neglected component of Microsoft’s leading database server software, SQL Server Data Mining (SSDM) can be extended through many means, such as Analysis Services stored procedures, CLR functionality, custom mining functions and plug-in algorithms. I had… Read more

0 comments, 1,122 reads

Posted in Multidimensional Mayhem on 11 February 2014

A Rickety Stairway to SQL Server Data Mining, Part 14.8: PMML Hell


By Steve Bolton

…………In A Rickety Stairway to SQL Server Data Mining, Part 14.3: Debugging and Deployment, we passed the apex of this series of amateur self-tutorials on SQL Server Data Mining (SSDM) and have seen the difficulty level and real-world usefulness of the material decline on a… Read more

0 comments, 475 reads

Posted in Multidimensional Mayhem on 15 January 2014

A Rickety Stairway to SQL Server Data Mining, Part 14.7: Additional Plugin Functionality


By Steve Bolton

…………In order to divide this segment of my amateur tutorial series on SQL Server Data Mining (SSDM) into digestible chunks, I deferred discussion of some functionality that can be implemented in custom algorithms. In the last installment I explained how to write custom data mining functions,… Read more

0 comments, 988 reads

Posted in Multidimensional Mayhem on 31 December 2013

A Rickety Stairway to SQL Server Data Mining, Part 14.6: Custom Mining Functions


by Steve Bolton

…………In the last installment of this amateur series of mistutorials on SQL Server Data Mining (SSDM), I explained how to implement the Predict method, which controls how the various out-of-the-box prediction functions included with SSDM are populated in custom algorithms. This limits users to returning the…

Read more

0 comments, 1,396 reads

Posted in Multidimensional Mayhem on 28 November 2013

A Rickety Stairway to SQL Server Data Mining, Part 14.5: The Predict Method

By Steve Bolton

…………In order to divide the Herculean task of describing custom algorithms into bite-sized chunks, I omitted discussion of some plug-in functionality from previous installments of this series of amateur tutorials on SQL Server Data Mining (SSDM). This week’s installment ought to be easily digestible, since it… Read more

2 comments, 1,573 reads

Posted in Multidimensional Mayhem on 30 October 2013

A Rickety Stairway to SQL Server Data Mining, Part 14.4: Node Navigation


By Steve Bolton

…………The pinnacle of difficulty in this series of amateur self-tutorials on SQL Server Data Mining (SSDM) was surmounted in the last installment, when we addressed the debugging and deployment of custom plug-in algorithms. From here on in, we will be descending back down the stairway, at… Read more

0 comments, 1,260 reads

Posted in Multidimensional Mayhem on 15 October 2013

A Rickety Stairway to SQL Server Data Mining, Part 14.3: Debugging and Deployment


By Steve Bolton

…………Throughout this series of amateur self-tutorials in SQL Server Data Mining (SSDM), I’ve often said that working with Analysis Services is a bit like blasting off into space with the Starship Enterprise, because you may be boldly going where no man has gone before. My An… Read more

0 comments, 673 reads

Posted in Multidimensional Mayhem on 16 September 2013

A Rickety Stairway to SQL Server Data Mining, Part 14.2: Writing a Bare Bones Plugin Algorithm

By Steve Bolton

…………As I’ve said many times throughout this series of amateur mistutorials, SQL Server Data Mining (SSDM) is one of the most powerful yet neglected products Microsoft has to offer. The ability to extend it with your own algorithms is in turn the most powerful yet neglected… Read more

0 comments, 1,596 reads

Posted in Multidimensional Mayhem on 26 August 2013

At the Top of the Rickety Stairway…

by Steve Bolton

…………As expected, I’ve run into quite a few obstacles while trying to deploy and debug my first plug-in algorithm. I’ve finally emerged from “DLL Hell” and have successfully deployed the first algorithm I’ll be using in this tutorial, but still have fatal errors lurking somewhere in… Read more

0 comments, 323 reads

Posted in Multidimensional Mayhem on 31 July 2013

A Rickety Stairway to SQL Server Data Mining, Part 14.1: An Introduction to Plug-In Algorithms


by Steve Bolton

…………In my last post in this amateur series of self-tutorials on SQL Server Data Mining (SSDM), I got into a lengthy discussion of how neglected but powerful SQL Server Analysis Services (SSAS) stored procedures are. This is part of a larger pattern of under-utilization of some… Read more

0 comments, 1,791 reads

Posted in Multidimensional Mayhem on 25 June 2013

A Rickety Stairway to SQL Server Data Mining, Part 13: Using CLR with Analysis Services


by Steve Bolton

               I was drawn into the realm of SQL Server in a roundabout manner thanks to Visual Basic. Around the time I got my Microsoft Certified Solution Developer (MCSD) certification in VB 6.0 (at precisely the same time .Net hit the market, instantly leaving me a year or… Read more

0 comments, 1,952 reads

Posted in Multidimensional Mayhem on 23 May 2013

Older posts