-->
Click here to monitor SSC
SQLServerCentral is supported by Redgate
 
Log in  ::  Register  ::  Not logged in
 
 
 

Mala's Data Blog

My name is Malathi, a.k.a Mala - I am a DBA turned BI/Data Science person, working with SQL Server since 6.5. I am also founder of the Louisville SQL Server user group, organizer of 8 SQL Saturdays, Regional mentor for northeast, and 12-year PASS conference attendee. In my spare time I love to garden, travel, read, paint, and do yoga.

Basics of Probability

In this post am going to introduce into some of the basic principles of probability – and use it in other posts going forward. Quite a number of people would have learned these things in high school math and then forgotten – I personally needed a refresher. These concepts are… Read more

0 comments, 1,071 reads

Posted in Mala's Data Blog on 20 March 2017

TSQL2sday – Daily database WTF

This month’s TSQL Tuesday is organized by Kennie T Pontoppidan(t) – the topic is ‘Daily Database WTF‘ – or a horror story from the database world.  As someone who has worked databases for nearly two decades, there are several of these – I picked one of… Read more

0 comments, 109 reads

Posted in Mala's Data Blog on 12 March 2017

Generating Frequency Table

This week’s blog post is rather simple. One of the main characteristics of a data set involving classes, or discrete variables – are frequencies. The number of times each data element or class is observed is called its frequency. A table that displays the discrete variable and number of times… Read more

0 comments, 1,513 reads

Posted in Mala's Data Blog on 6 March 2017

The Empirical Rule

I am resuming technical blogging after a gap of nearly a month. I will continue to blog my re learning of statistics and basic concepts, and illustrate them to the best of my ability using R and T-SQL where appropriate.

For this week I have chosen a statistical concept called… Read more

0 comments, 1,370 reads

Posted in Mala's Data Blog on 27 February 2017

SQL Cruise 2017 – Western Carribean – my experience

As some readers may know, I am a regular attendee on SQL Cruise s for 8 years now. SQLCruise is a training(&-vacation for some) event organized by Tim Ford(b | t ) and Amy Ford (t) that happens twice a year. I went on the first one… Read more

0 comments, 244 reads

Posted in Mala's Data Blog on 19 February 2017

My Epic Life Quest

I have always maintained a private bucket list. I have not had the courage to actually put it down in writing – but this year I decided that it is time. My good friend Brent Ozar has been doing this for a few years now, and his list is my… Read more

0 comments, 1,501 reads

Posted in Mala's Data Blog on 2 January 2017

2016 – A Year to remember

2016 has undoubtedly been a landmark year in my life. To me it marked my first conscious entry into mid age. It was the first year that I really pondered some of the questions that people need to think of as they get older in life – with clarity that… Read more

0 comments, 256 reads

Posted in Mala's Data Blog on 31 December 2016

Multivariate Variable Analysis using R

So far I’ve worked on simple analytical techniques using one or two variables in a dataset. This article is a sort of a summary – about various techniques we can use for such datasets, depending on the type of variable in question. The techniques include – how to get summary… Read more

0 comments, 344 reads

Posted in Mala's Data Blog on 5 December 2016

Associative Analytics: Two sample T Test

In the previous post we looked at a one way T-Test. A one way T Test helped us determine if a selected sample was indeed truly representative of the larger population. A Two way T Test goes a step further – it helps us determine if both samples came from… Read more

0 comments, 1,067 reads

Posted in Mala's Data Blog on 21 November 2016

PASSion Award and what it means to me

2016 is going to be a special year in my life. There was an article on Oscar awards a while ago – on reasons why the oscar is the most watched awards ceremony around the world. No, it is not just because of movie stars. Everyone, secretly or publicly –… Read more

9 comments, 669 reads

Posted in Mala's Data Blog on 31 October 2016

Days 1,2 and 3 of PASS Summit 2016

Today is Thursday, October 27th already. For some of us the summit begins monday – with precons and PASS Volunteering related meetings on Tuesday. For most other attendees the first day was Wednesday.

I arrived in the afternoon on Sunday with six other friends from Louisville,including my good friend Chris… Read more

0 comments, 197 reads

Posted in Mala's Data Blog on 27 October 2016

Sending Trevor love…

As some of you may be aware – fellow SQL family member, PASS Director, SQL Server MVP, founder of SQL Cruise and a good friend to many of us – Tim Ford – has a young son Trevor Ford who was recently diagnosed with a rare allergic drug reaction called… Read more

0 comments, 209 reads

Posted in Mala's Data Blog on 26 October 2016

TSQL Tuesday #83 – The Stats update solution

TSQL Tuesday is a monthly blog part hosted by a different blogger every month – it was started by Adam Machanic. This week’s TSQL Tuesday is hosted by Andy Mallon – the topic is ‘We’re dealing with the same problem’. I have chosen to write about a common problem… Read more

0 comments, 179 reads

Posted in Mala's Data Blog on 11 October 2016

11 years of PASS Summit

This is a story of my 11-year association with PASS, and the many ways it helped me grow as a person and in my career. And the many ways I saw other people grow.

 Summit #1 – 2006: Gaylord,TX: I was a visa holding dba-developer at a small shop.… Read more

0 comments, 227 reads

Posted in Mala's Data Blog on 23 September 2016

Why SQL Cruise?

I was riding the elevator up from lunch today, at work. I am relatively new at my job and do not know several people at my workplace – yet. I live in a small town, and quite a lot of them know me as someone active in the local community.… Read more

2 comments, 275 reads

Posted in Mala's Data Blog on 23 September 2016

Associative Statistics – One sample T-Test with TSQL and R

In this post am going to attempt to explore a statistical procedure called ‘One Sample T Test’.

A T-Test is used to test the mean value of a data sample against a known mean of the entire population from where the sample came from. An example would be, if the… Read more

1 comments, 1,296 reads

Posted in Mala's Data Blog on 21 September 2016

Budgeting for SQL Saturdays

We’ve been doing SQLSaturdays for eight years at Louisville now. We’ve had a quite a wide range of budgets depending on the year – a good year (our highest was 2013) and lowest, the first year we started, 2009.

As main organizer, handling the dollars and making sure I spend… Read more

2 comments, 388 reads

Posted in Mala's Data Blog on 13 September 2016

Statistics with TSQL and R: Chi Square Test

As I move on from descriptive and  largely univariate (one variable based) analysis of data into more multivariate data – one of the first data analysis tests that came to mind is the Chi Square Test. It is a very commonly used test to understand relationships between two variables that… Read more

6 comments, 1,570 reads

Posted in Mala's Data Blog on 13 September 2016

Statistics with T-SQL and R – the Pearson’s Correlation Coefficient

In this post I will attempt to explore calculation of a very basic statistic based on linear relationship between two variables. That is, a number that tells you if two numeric variables in a dataset are possibly correlated and if yes, by what degree. The Pearson’s coefficient is a number… Read more

2 comments, 2,114 reads

Posted in Mala's Data Blog on 5 September 2016

Script for creating test data for odds ratio

Make sure you have a working version of SQL Server 2016.

USE [master]
GO

/****** Object: Database [WorldHealth] ******/
CREATE DATABASE [WorldHealth]
CONTAINMENT = NONE
ON PRIMARY
( NAME = N’WorldHealth’, FILENAME = N’D:\Microsoft SQL Server\DATA\WorldHealth.mdf’ , SIZE = 8192KB , MAXSIZE = UNLIMITED, FILEGROWTH = 65536KB )
LOG ON Read more

0 comments, 223 reads

Posted in Mala's Data Blog on 16 August 2016

Older posts