June 2, 2010 at 10:25 am
I am building a project to study MS BI, and would like some input.
My goal is to create an data mining model which assigns the job post to a cluster based on type of job.
What follows is what I have done so far.
I would appreciate any input you have, from descriptive style to process logic as I am working without significant DM experience, Relational DB management experience, or a mentor of any kind.
Steps to this point...
Build SSIS packages which
The data looks like this
Create Structure based on
Create mining model based on
max_input_attributes = zero [/li]
default for other parameters[/li]
Found a bunch of terms that were obviously not related to the job type so
Found a bunch of terms with the same meaning so
SO now the top x terms for each cluster are beginning to show similaraties
but there are lot of medical / nursing jobs in the database that do not appear in a cluster..
I will try increasing the number of clusters, and report back.
?????
Should my dictionary be much much smaller?
Sould I pick examples from each category and search for similar posts (how would that be done)
Here's hoping for a lively discussion!
Rob
Viewing post 1 (of 1 total)
You must be logged in to reply to this topic. Login to reply