Press "Enter" to skip to content

The problem with personalization

The challenge of personalizing the news is the heterogeneity of our personal interests and the weakness of the signal we expose to the recommendation engine.

As an example, below are four stories from around the country I was interested in today, for reasons ranging from ‘possibly obvious’ to ‘unknowable’ by an algorithm. And by ‘unknowable’ I mean how likely would a machine be to affirmatively pick any of these four at a very low signal to noise ratio.

The dot-Boston domain is now open
Why am I interested? I was working at the Globe when we originally bid on and won the rights to sell this TLD. (I proposed renaming Boston.com to com.Boston. Just because.)
Could a machine have affirmatively predicted my interest? No.

Micro-apartments proposed for former mill building on Saco Island
Why am I interested? We have relatives that live in the complex and I spent some time working in the area. Could a machine have affirmatively predicted my interest? Not likely.

NBC moves 130 Premier League games to streaming service
Why am I interested? I am a big fan of the Premier League and watch games on NBC – but found this by random chance at our paper in Sacramento. Could a machine have affirmatively predicted my interest? Yes, I probably leave a wide paper trail on this topic.

Heavy traffic, cellphone service disruption expected in Charleston for total solar eclipse
Why am I interested? We will be in town for the eclipse. Could a machine have affirmatively predicted my interest? If the algorithm knew my calendar, correlated the travel with the geographically-specific eclipse event and put the two together – yes. In the current reality, no.

So what’s the ‘problem’ here? Personalization depends mostly on observed web behaviors. Much of our interest in the news is based on real life experiences and events. So to provide me a list of recommended stories you need to know not what I clicked on last week, but where I lived in 1998 and my level of interest in urban planning issues.

So even though Facebook knows ostensibly everything about me, and Twitter is packed with people I know/trust and rely on for news – neither of those platforms or any other app I am aware of is going to identify those four stories on a given day and surface them in a unified newsfeed.

In fact I will will be suitably impressed if a machine will ever be able to perform at that level of serendipity.  But, if you invent it, I would pay for that convenience as a service.