Member-only story

Two hyperparameters every ML engineer should care

2 min readNov 11, 2020

Building a successful ML model can involve twisting and tweaking hyperparameters that give the best model for the dataset you are working with. But among all other hyperparameters, there are some of them that are present in almost any machine learning or deep learning algorithms and play an important role. Without having a good idea of how to tweak them, it is often impossible to build a model with good accuracy.

Learning Rate(alpha)

An artificial neural network is trained in optimization algorithms like Gradient descent, Stochastic Gradient Descent, and Adam optimization. The objective of these algorithms is to find the global minimum in a convex function. To reach the global minimum, we move downwards in that convex function. So while moving downwards, we can choose how aggressively we want to step down to reach the global minimum. The more aggressive we step, the faster we reach the minimum. In other words, we can train the machine learning model faster but the downside is that we can overshoot the minimum.

source: https://gfycat.com/angryinconsequentialdiplodocus

2. Batch size

The batch size is also an important hyperparameter that determines the number of samples that…

Create an account to read the full story.

The author made this story available to Medium members only.
If you’re new to Medium, create a new account to read this story on us.

Continue in app

Or, continue in mobile web

Sign up with Google

Sign up with Facebook

Already have an account? Sign in

Written by Rabin Poudyal

Software Engineer, Data Science Practitioner. Say "Hi!" via email: rabinpoudyal1995@gmail.com or visit my website https://rabinpoudyal.com.np

No responses yet

Write a response

What are your thoughts?

Also publish to my profile

More from Rabin Poudyal

Building a knowledge graph in python from scratch

Rabin Poudyal

Building a knowledge graph in python from scratch

A knowledge graph is one of the widely used applications of machine learning that tech giants like Google and Microsoft are using in their…

Jan 12, 2020

Build a recommendation engine from scratch for your university project

Rabin Poudyal

Build a recommendation engine from scratch for your university project

Almost every CS student need to complete a final year project. There is a lot of confusion in what language to choose, what frameworks to…

Sep 7, 2018

Content Based Filtering in Recommendation Systems

Rabin Poudyal

Content Based Filtering in Recommendation Systems

This is one of the simple approach of recommending products or contents to the user. The idea here is that if a user indicates (s)he likes…

Jun 21, 2018

Nearest neighbour based method for collaborative filtering

Rabin Poudyal

Nearest neighbour based method for collaborative filtering

It is one of the method for performing collaborative filtering. If collaborative filtering is new to you don’t forget to read this article…

Jun 25, 2018

See all from Rabin Poudyal

Recommended from Medium

Active Learning for Data Labeling

In

Biased-Algorithms

by

Amit Yadav

Active Learning for Data Labeling

Problem Overview

Oct 10, 2024

Data Science All Algorithm Cheatsheet 2025

In

Artificial Intelligence in Plain English

by

Ritesh Gupta

Data Science All Algorithm Cheatsheet 2025

Stories, strategies, and secrets to choosing the perfect algorithm.

Jan 5

Lists

Predictive Modeling w/ Python

20 stories1857 saves

Practical Guides to Machine Learning

10 stories2225 saves

Coding & Development

11 stories1033 saves

Natural Language Processing

1977 stories1620 saves

How I Cracked the Meta Machine Learning Engineering Interview

In

TDS Archive

by

Samuel Flender

How I Cracked the Meta Machine Learning Engineering Interview

Practical tips for the coding, design, and behavior rounds

Oct 25, 2022

ML Engineer Interview Experience at DBS Bank

In

Dev Genius

by

Prem Vishnoi(cloudvala)

ML Engineer Interview Experience at DBS Bank

Introduction:

Jan 7

The 5 paid subscriptions I actually use in 2025 as a Staff Software Engineer

In

Level Up Coding

by

Jacob Bennett

The 5 paid subscriptions I actually use in 2025 as a Staff Software Engineer

Tools I use that are cheaper than Netflix

Jan 7

Mastering ML Model Deployment: From Manual to Pro-Level Automation

Daniel García

Mastering ML Model Deployment: From Manual to Pro-Level Automation

Free link to Fullstack ML engineering handbook

Oct 7, 2024

See more recommendations

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams