Solving a machine learning problem can be a daunting affair for beginner data scientists. There are simply so many algorithms to choose from! Simply go to scikit-learn‘s page, and you are already overwhelmed by all the options! One of the main challenges is that if you get bad performance results, you can’t be sure whether it is your fault or the dataset is simply not good enough.
Through all the years of practice, I have developed a process that I am using to quickly figure out whether the data is of good quality or not.