Challenges in building recommendation systems

3 min readJul 10, 2018

A recommendation engine can be gem whereas same engine can be nightmare if it can be easily fooled by the people on the system can be manipulated easily.

Here are few basic situations and their solution to prevent this from happening. If you are new to recommendation you should definitely check this post and continue to rest of the article.

Collaborative filtering is the technique of recommending products to users with the help of other similar users. The core idea is that people with similar preferences will like similar type of products. Collaborative filtering is a general term and there are many algorithms that use this concept to recommend products. Latent collaborative filtering is one of the most used collaborative filtering algorithm that performs matrix factorization to recommend most relevant product to users. There is also deep learning approach for collaborative filtering that outperforms most of other traditional techniques. While implementing these algorithms in our application we come up with few problems that are described below:

Cold Start Problem:

How do you deal with new users and products that don’t have any history?

Solution:

Use content-boosted filtering approach. It is combination of content-based filtering and collaborative filtering. You can use product description and attributes as well as user demographic to recommend products to users.

Data Sparsity:

UserItem rating matrix is very sparse( many null items ) because stores have many products and all those products will not be rated by many users. Actually very few people rate products. Think about how many times have you rated products after you bought products online? This sparsity makes training computationally inefficient.

Solution:

Use dimensionality reduction. Remove unnecessary users and products from where we are not learning much and reduce sparsity of user-item rating matrix.

Grey-Sheep Problem:

Now here comes a weirdo person in our app. From the name grey sheep we understand his behavior are unpredictable. He may say Game Of Thrones 1 is good and Game Of Thrones 2 is worst. Basically how do we deal with these weird people whose opinion are inconsistent.

Solution:

Pure collaborative filtering does not work. So use content-boosted filtering like in cold start problem.

Synonymy:

How will you deal with the products that are practically same but different. For example, different editions of a book or pdf or physical copy of book. Since you don’t use product description for collaborative filtering you can miss the information about synonymy. Since online stores have different codes for these items, finding synonymy can be problem.

Solution:

Latent collaborative filtering is the type of algorithm that can identify hidden factors from the data. This algorithm works really well for synonymy as well. So if we have lot of items with synonymy this is the way to go.

Shilling Attacks:

How do you deal with people who are trying to game the recommendation system? For example our system has a weird author who gave ton of ratings for his books and tons of negative ratings for other people’s book.

Solution:

Take precautions and monitor user behavior.

I will discuss more about recommendation systems in my coming articles. If you like this post, don’t forget to clap the post and follow me on medium and on twitter.

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

Machine Learning

Deep Learning

Artificial Intelligence

Data Science

Collaborative Filtering

Written by Rabin Poudyal

272 Followers

27 Following

Software Engineer, Data Science Practitioner. Say "Hi!" via email: rabinpoudyal1995@gmail.com or visit my website https://rabinpoudyal.com.np

No responses yet

Write a response

What are your thoughts?

Also publish to my profile

More from Rabin Poudyal

Building a knowledge graph in python from scratch

Rabin Poudyal

Building a knowledge graph in python from scratch

A knowledge graph is one of the widely used applications of machine learning that tech giants like Google and Microsoft are using in their…

Jan 12, 2020

Build a recommendation engine from scratch for your university project

Rabin Poudyal

Build a recommendation engine from scratch for your university project

Almost every CS student need to complete a final year project. There is a lot of confusion in what language to choose, what frameworks to…

Sep 7, 2018

Content Based Filtering in Recommendation Systems

Rabin Poudyal

Content Based Filtering in Recommendation Systems

This is one of the simple approach of recommending products or contents to the user. The idea here is that if a user indicates (s)he likes…

Jun 21, 2018

Nearest neighbour based method for collaborative filtering

Rabin Poudyal

Nearest neighbour based method for collaborative filtering

It is one of the method for performing collaborative filtering. If collaborative filtering is new to you don’t forget to read this article…

Jun 25, 2018

See all from Rabin Poudyal

Recommended from Medium

How Does Our Sense of Humor Change With Age? A Statistical Analysis

Fanfare

Daniel Parris

How Does Our Sense of Humor Change With Age? A Statistical Analysis

How do our comedic sensibilities form and transform over time?

Jun 22, 2024

Developing a Recommendation System Application

Configr Technologies

Step-by-Step Guide to Developing a Recommendation System Application

How to Build a Recommendation System Application with Python, Flask, and React

Oct 8, 2024

Lists

Predictive Modeling w/ Python

20 stories1857 saves

Natural Language Processing

1977 stories1620 saves

Practical Guides to Machine Learning

10 stories2225 saves

data science and AI

40 stories340 saves

Hyper-Personalization with LLM — Multi ML Models and Flask App

Simranjeet Singh

Hyper-Personalization with LLM — Multi ML Models and Flask App

This blog explores hyper-personalization using machine learning models and large language models (LLMs) along with dynamic pricing and…

Oct 20, 2024

Exploring Recommendation Systems: Review of Matrix Factorization & Deep Learning Models

TDS Archive

Angel Das

Exploring Recommendation Systems: Review of Matrix Factorization & Deep Learning Models

Summary of Recommender Systems (Alternate Least Square, LightFM, Matrix Factorization with Neural Networks, and Neural Collaborative…

Nov 10, 2022

Introduction to Embedding-Based Recommender Systems

Data Science Collective

Dr. Robert Kübler

Introduction to Embedding-Based Recommender Systems

Learn to build a simple recommender in TensorFlow

Jan 25, 2023

Next Generation of Click Through Rate Prediction — Episode 1

Shobeir Seddington

Next Generation of Click Through Rate Prediction — Episode 1

DCN v3 Paper Review

Sep 17, 2024

See more recommendations

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams