I need a large sample dataset, e.g. a 1-3GB CSV. I usually use data.gov for this kind of thing but, for the project I am working on it the data needs to be completely non-controversial. E.g. I can't download Crime Statistics, Consumer complaints... It seems like all the huge CSV files on data.gov are things of this nature. For reasons I can't go into - I can't use adventureworksDW or anything from codeplex.
I thought this would be simple but it's turning out to be a pain in the rear. I am having surprisingly bad luck when googling "Huge CSV sample data download" and stuff like that.
"I cant stress enough the importance of switching from a sequential files mindset to set-based thinking. After you make the switch, you can spend your time tuning and optimizing your queries instead of maintaining lengthy, poor-performing code."
-- Itzik Ben-Gan 2001