Click here to monitor SSC
SQLServerCentral is supported by Red Gate Software Ltd.
 
Log in  ::  Register  ::  Not logged in
 
 
 
        
Home       Members    Calendar    Who's On


Add to briefcase

Things you wish you would have found about Data Mining Expand / Collapse
Author
Message
Posted Friday, February 17, 2012 9:14 AM
SSC Rookie

SSC RookieSSC RookieSSC RookieSSC RookieSSC RookieSSC RookieSSC RookieSSC Rookie

Group: General Forum Members
Last Login: Friday, March 16, 2012 7:44 AM
Points: 36, Visits: 112
I basically want this thread to be a good way for the newbs like me to get up to speed with some of the knowledge the veterans have.

For example did you guys find books or web posts that when you read them helped you understand data mining better than before? If so post them up. I am trying to get myself up to speed on Data mining because i think the business is going to want it in the future. Any help in that process from you guys would be awesome and i thank you ahead of time.

Chris


Never stop learning or you will be left behind.
Post #1253974
Posted Friday, February 17, 2012 9:25 AM


SSChampion

SSChampionSSChampionSSChampionSSChampionSSChampionSSChampionSSChampionSSChampionSSChampionSSChampion

Group: General Forum Members
Last Login: 2 days ago @ 9:58 AM
Points: 13,872, Visits: 9,600
I think the key thing to know about any data mining is the differences between causation, correlation, and coincidence. It doesn't matter how good your technical skills are on the subject, if you can't spot those.

After that, learn how to judge data quality. There are ten or twelve major issues you'll find in data quality, regardless of the tools you use or the techniques you use them with, that will cause data mining to fail or produce false results if you don't know them thoroughly. You have to be able to spot the classical patterns like dropped out time, contrary facts, et al, without hesitation. Converse for the positive data quality metrics. You need to know those just as well.

After that, it's just all about the tools. Those will vary, and in a shop that's just moving into the field you'll probably be able to define what you want instead of having to learn legacy tools. That puts you in the driver's seat on that point.

But no tool available can make up for mistaking coincidence for cause or missing that a datum is from the wrong time period to be applicable, for example.


- Gus "GSquared", RSVP, OODA, MAP, NMVP, FAQ, SAT, SQL, DNA, RNA, UOI, IOU, AM, PM, AD, BC, BCE, USA, UN, CF, ROFL, LOL, ETC
Property of The Thread

"Nobody knows the age of the human race, but everyone agrees it's old enough to know better." - Anon
Post #1253979
Posted Friday, February 17, 2012 9:34 AM
SSC Rookie

SSC RookieSSC RookieSSC RookieSSC RookieSSC RookieSSC RookieSSC RookieSSC Rookie

Group: General Forum Members
Last Login: Friday, March 16, 2012 7:44 AM
Points: 36, Visits: 112
Awesome GS thank you for the post very good info.

Never stop learning or you will be left behind.
Post #1253984
« Prev Topic | Next Topic »

Add to briefcase

Permissions Expand / Collapse