A Step-By-Step approach to analyze Data problems (Series 101 - Part 1)
Hello ! In this post, we shall see the how to take a systematic approach towards solving data problems. The approach has been intended to be as generic as possible so as to envelope the entire super-set of Machine Learning problems. Each of these steps will be covered in depth in the upcoming posts. Without much ado, let us grab a cup of coffee and get into the details. Let us consider a real-world problem. The Problem Let's say that we wish to predict whether a certain song (new song) will be liked by an individual or not. Data Collection The first step is to identify the source of data which has relevant information, and it should be one which is sufficiently large enough for us to be able to do fair analysis. The amount of data which could be termed as "sufficient" could vary as per the use-case and the nature of the data itself. For our use-case, we consider data from Spotify. (I would be explaining in a later post as to how to retrieve data from Spotify ...