Latest Posts
Improving Your Terminal Experience with Zsh As a data scientist you better know your way around Bash. While you’ll want to do serious modeling in Python or R, getting comfortable at the command line will only speed up your work flows An Introduction to Interactive Visualizations with Bokeh Some highlights from my ODSC presentation on interactive visualizations. Predicting NHL success from the Canadian Hockey Leagues (CHL) I examined players from the Canadian Hockey League and built a classifier to predict whether or not a player would make it to the NHL and record at the minimum a 20 point season. Principal Component Analysis & Clustering with Airport Delay Data Taking a look at airport datasets to determine root characteristics that lead to airport delays. Predicting IMDB Ratings In this project we pulled the top 250 movies from IMDB, and scraped their reviews from moviegoers. We used word counts to build a random forests regressor to predict the IMDB score given a test set of data.