Category Weekly Wednesday Lectures
Week 7: hunch.com, Recommendation Engines, SVD, Alternating Least Squares, Convexity, Filter Bubbles
Each week Cathy O’Neil blogs about the class. Cross-posted from mathbabe.org Last night in Rachel Schutt’s Columbia Data Science course we had Matt Gattis come and talk to us about recommendation engines. Matt graduated from MIT in CS, worked at SiteAdvisor, and co-founded hunch as its CTO, which recently got acquired by eBay. Here’s what […]
Week 6: Kaggle, crowdsourcing, decision trees, random forests, social networks, and Google’s hybrid research environment
Each week Cathy O’Neil blogs about the class. Cross-posted from mathbabe.org Yesterday we had two guest lecturers, who took up approximately half the time each. First we welcomed William Cukierski from Kaggle, a data science competition platform. Will went to Cornell for a B.A. in physics and to Rutgers to get his Ph.D. in biomedical […]
Week 5: GetGlue, time series, financial modeling, advanced regression, and ethics
Each week Cathy O’Neil blogs about the class. Cross-posted from mathbabe.org. But what makes this week unique is that Cathy was our guest lecturer. So first I need to introduce her, and then what follows is her blog post. Students in the class already know Cathy because she comes each week, asks good questions and […]
Week 4: The Data Science Process, k-means, Classifiers, Logistic Regression and Evaluation
Each week Cathy O’Neil blogs about the class. Cross-posted from mathbabe.org This week our guest lecturer for the Columbia Data Science class was Brian Dalessandro. Brian works at Media6Degrees as a VP of Data Science, and he’s super active in the research community. He’s also served as co-chair of the KDD competition. Before Brian started, […]
Week 3: Naive Bayes, Laplace Smoothing, APIs and Scraping data off the web
Cathy O’Neil blogs about the class each week. Crossposted from mathbabe.org In the third week of the Columbia Data Science course, our guest lecturer was Jake Hofman. Jake is at Microsoft Research after recently leaving Yahoo! Research. He got a Ph.D. in physics at Columbia and taught a fantastic course on modeling last semester at […]
Week 2: Simulated Chaos, RealDirect, linear regression, k-nearest neighbors
Cathy O’Neil blogs about the class each week. Crossposted from mathbabe.org Data Science Blog Today we started with discussing Rachel’s new blog, which is awesome and people should check it out for her words of data science wisdom. The topics she’s riffed on so far include: Why I proposed the course, EDA (exploratory data analysis), […]
Week 1: What is Data Science?
Cathy O’Neil will be blogging about the class after each lecture. Crossposted from mathbabe.org I’m attending Rachel Schutt’s Columbia University Data Science course on Wednesdays this semester and I’m planning to blog the class. Here’s what happened yesterday at the first meeting. Syllabus Rachel started by going through the syllabus. Here were her main points: […]