Discussion of the mathematics of model overfitting, how it happens, the problem it creates, and a mitigation strategy.
I’m a computer science student pursuing an analytics career. Learn more about me here.
I began studying analytics in late 2017. Since then, I've become convinced of two things:
- The rate at which society generates data is increasing, as is demand for people with analytics skills.
- This work is complex and multifaceted. New techniques, tools, and methodologies are developed every day.
If you'd like to connect, there are several ways of reaching out to me near my avatar photo (top left).
Multiple linear regression using the Excel Linest function.
An overview of the Python Scientific Stack, courtesy of a Quora post written by Yassine Alouini and Jake VanderPlas.
Comparing the Information Gain of Various Data and Models.
Histograms and descriptions of some common Probability Density Functions.