If you are considering a career in Business Analytics and are a database query geek, there are many free sources of data on the Internet to play with. Here are 10 examples from the mundane – grain and construction — to the entertaining– baseball and hip-hop lyrics.
- U.S. Export Balances and Cumulative Exports (US Gov’t.)
- Annual Retail Trade Survey (US Gov’t.)
- Occupational Employment Statistics – Employment and Wages (US Gov’t.)
- 2000 US Census (Amazon)
- 1000 Genomes: A catalog of human genetic variation (Amazon)
- Wikipedia Page Traffic Statistic V3 : Seven months of hourly page traffic statistics for over 2.5 Million Wikipedia articles with the associated content and metadata. (Amazon)
- 2003-2006 US Economic Data (Amazon)
- Bureau of Labor Statistics (Google)
- Pitch-by-pitch data for Major League Baseball (MLB) games in 2016 (Google)
- Every mention of the 2016 primary candidates in hip-hop songs (FiveThirtyEight data made available by GitHub)
Mentions of Donald Trump in the hip-hop data trove were 160 positive (mentions of his wealth dominate), 74 neutral, and 34 negative.
If you want to hunt for your own data to play with, GGU Adjunct Professor Rao Mikkilineni suggests this list of 25 websites for data science projects which contain the sources above. Mikkilineni currently teaches courses in GGU’s master’s degree in Business Analytics program.