Hi guys,
This was an exercise given several days ago. I could not find a good way to solve the relationship between these two tables.
Views per 5 min interval
Programme Schedule
I would like to find information about the channel rankings (Popular overall and by channel) but the way the data was presented makes it hard to build a relationship between the two tables.
I used a very messy and inaccurate solution on excel. I cleaned up the data using R (subset the programme schedule by channels to decrease the dataset burden) and used an adjusted start time (round up to the nearest 5mins) as the basis of my relationship. I then used Vlookup to match the adjusted start times of the Programme and the Views at each interval.
It worked somewhat. However, the views were not too accurate. Channels with less shows but longer showtime were not getting counted often. While Channels with numerous shows but shorter showtime were getting counted more often. So I used average views per showing. However total views are inaccurate.
I'd like to know how you would tackle this problem. I have been stumped on this one for a some time
Regards
Zac