An exercise in web scraping, data cleaning, and visualization that conclusively answers the question of where most data jobs are located in California. Tools utilized include Python, Pandas, Tableau, regular expressions, and BeautifulSoup (5/18).
An exploration and visualization of, and commentary on, data pertaining to the rise of the service economy. Utilizes Python, Pandas, and leverages Matplotlib extensively (2/18).
Gathering, assessing, and cleaning data related to the tweet history of the famous WeRateDogs Twitter account. Utilizes Python, Pandas, and Matplotlib (4/18).
A dataset exploration exercise using a dataset containing physiochemical attributes and quality ratings for a selection of 1600 red wines. Utilizes R, ggplot2, and ggpairs (4/18).
A brief exercise in inferential statistics demonstrating a statistically significant result that establishes the presence of a well-known psychological phenomena. Tools utilized include Python, Pandas, and Matplotlib (2/18).
A project that leverages inferential statistics and hypothesis testing in combination with a database of user conversion rates to determine whether a company should adopt a new web page or keep the old one. Utilizes Python, Pandas, Matplotlib, and logistic regression via the statsmodels.api library (2/18).