Category Data Wrangling

The Data Science Process

Dear Students, Now that we’ve had our first guest lecture, I’d like to revisit the general framework I proposed for thinking about the data science process on the first day of class (when I generalized the example from Google Plus), and show how Jake’s lecture fits within this framework. Throughout the semester we’ll see that […]

Week 3: Naive Bayes, Laplace Smoothing, APIs and Scraping data off the web

Cathy O’Neil blogs about the class each week. Crossposted from mathbabe.org In the third week of the Columbia Data Science course, our guest lecturer was Jake Hofman. Jake is at Microsoft Research after recently leaving Yahoo! Research. He got a Ph.D. in physics at Columbia and taught a fantastic course on modeling last semester at […]

Follow

Get every new post delivered to your Inbox.

Join 435 other followers

Build a website with WordPress.com