Machine Learning Interview Questions

Review this list of 68 machine learning interview questions and answers verified by hiring managers and candidates.

+ Add interview

Product

Engineering

Operations

Design

Marketing

Data

Sales

Finance

Consulting

Security

Add interview

Machine Learning Engineer Data Scientist Technical Program Manager Software Engineer Product Manager Engineering Manager

Design an ETA System for a Maps App.
Machine Learning
System Design
1 answer I was asked this
"I've watched all the ML Systems designs interviews and this solution provides a clean baseline for predicting ETA using historical averages, but it falls short of addressing the broader problem of route planning. The system predicts ETA for a given segment and time interval, but it doesn’t explain how to compute the ETA for an entire route or how to integrate this into dynamic path selection. It also lacks depth on handling real-time data, adapting to distribution shift, or reacting to sudden"
Clayton P. - "I've watched all the ML Systems designs interviews and this solution provides a clean baseline for predicting ETA using historical averages, but it falls short of addressing the broader problem of route planning. The system predicts ETA for a given segment and time interval, but it doesn’t explain how to compute the ETA for an entire route or how to integrate this into dynamic path selection. It also lacks depth on handling real-time data, adapting to distribution shift, or reacting to sudden"See full answer
Machine Learning
System Design
Asked at Amazon, Meta (Facebook), LinkedIn + 1 more • a month ago
Implement k-means clustering.
Machine Learning Engineer
Machine Learning
+5 more
2 answers I was asked this
"i dont know"
Dinesh K. - "i dont know"See full answer
Machine Learning Engineer
Machine Learning
+5 more
Asked at Pinterest • 2 years ago
Implement a k-nearest neighbors algorithm.
IDE
Easy
Machine Learning Engineer
Machine Learning
+1 more
6 answers I was asked this
+3
"Even more faster and vectorized version, using np.linalg.norm - to avoid loop and np.argpartition to select lowest k. We dont need to sort whole array - we need to be sure that first k elements are lower than the rest. import numpy as np def knn(Xtrain, ytrain, X_new, k): distances = np.linalg.norm(Xtrain - Xnew, axis=1) k_indices = np.argpartition(distances, k)[:k] # O(N) selection instead of O(N log N) sort return int(np.sum(ytrain[kindices]) > k / 2.0) `"
Dinar M. - "Even more faster and vectorized version, using np.linalg.norm - to avoid loop and np.argpartition to select lowest k. We dont need to sort whole array - we need to be sure that first k elements are lower than the rest. import numpy as np def knn(Xtrain, ytrain, X_new, k): distances = np.linalg.norm(Xtrain - Xnew, axis=1) k_indices = np.argpartition(distances, k)[:k] # O(N) selection instead of O(N log N) sort return int(np.sum(ytrain[kindices]) > k / 2.0) `"See full answer
Machine Learning Engineer
Machine Learning
+1 more
Predict Harmful Text
Hard
Machine Learning
Coding
Add answer I was asked this
Machine Learning
Coding
Asked at Nvidia, OpenAI • 9 months ago
What is overfitting or underfitting? Which models are most likely to experience this, and why?
Machine Learning Engineer
Machine Learning
+2 more
4 answers I was asked this
+1
"Over-fitting of a model occurs when model fails to generalize to any new data and has high variance withing training data whereas in under fitting model isn't able to uncover the underlying pattern in the training data and high bias. Tree based model like decision tree and random forest are likely to overfit whereas linear models like linear regression and logistic regression tends to under fit. There are many reasons why a Random forest can overfits easily 1. Model has grown to its full depth a"
Jyoti V. - "Over-fitting of a model occurs when model fails to generalize to any new data and has high variance withing training data whereas in under fitting model isn't able to uncover the underlying pattern in the training data and high bias. Tree based model like decision tree and random forest are likely to overfit whereas linear models like linear regression and logistic regression tends to under fit. There are many reasons why a Random forest can overfits easily 1. Model has grown to its full depth a"See full answer
Machine Learning Engineer
Machine Learning
+2 more

🧠 Want an expert answer to a question? Saving questions lets us know what content to make next.

Asked at Dropbox • a year ago
Design a recommender system feature for Dropbox that suggests files to users when they open the app on their phone.
Machine Learning Engineer
Machine Learning
+1 more
Add answer I was asked this
Machine Learning Engineer
Machine Learning
+1 more
In K-Nearest Neighbors (KNN), does setting k=1 lead to higher variance or higher bias?
Machine Learning
Concept
5 answers I was asked this
+2
"In details: setting k=1 in KNN makes the model fit very closely to the training data, capturing a lot of the data's noise and leading to a model that may not generalize well to unseen data. This results in a high-variance scenario."
Taha U. - "In details: setting k=1 in KNN makes the model fit very closely to the training data, capturing a lot of the data's noise and leading to a model that may not generalize well to unseen data. This results in a high-variance scenario."See full answer
Machine Learning
Concept
Find Statistical Evidence for Conversion Rate
Medium
Machine Learning
Coding
1 answer I was asked this
"1) create the experimental and control groups. 2) Then calculate the proportion (mean) of the true conversion rates for both groups using the convert column which counts True as 1 and False as 0. This is their conversion rates 3) calculate the statistic of the two groups by subtracting the proportion and standardizing. 4) get the p-value and compare with 0.05. 5) conclude the difference is statistically significant if the p-value is less than 0.05 otherwise no statistical difference"
Frank A. - "1) create the experimental and control groups. 2) Then calculate the proportion (mean) of the true conversion rates for both groups using the convert column which counts True as 1 and False as 0. This is their conversion rates 3) calculate the statistic of the two groups by subtracting the proportion and standardizing. 4) get the p-value and compare with 0.05. 5) conclude the difference is statistically significant if the p-value is less than 0.05 otherwise no statistical difference"See full answer
Machine Learning
Coding
Predict User App Deletion
Hard
Machine Learning
Coding
1 answer I was asked this
"While running the testloop I am getting an error RuntimeError: runningmean should contain 28 elements not 38. I think it's the difference between the categorical features in train and test. `"
Abinash S. - "While running the testloop I am getting an error RuntimeError: runningmean should contain 28 elements not 38. I think it's the difference between the categorical features in train and test. `"See full answer
Machine Learning
Coding
Asked at Capital One • 2 years ago
How do you stay up to date with advancements in machine learning?
Machine Learning Engineer
Machine Learning
+1 more
2 answers I was asked this
"through the combination of online resources, hands on project and community engagement."
Ihuoma remita U. - "through the combination of online resources, hands on project and community engagement."See full answer
Machine Learning Engineer
Machine Learning
+1 more
Asked at Amazon • 4 years ago
What are common linear regression problems?
Data Scientist
Machine Learning
+2 more
1 answer I was asked this
"I can try to summarize their discussion as I remembered. Linear regression is one of the method to predict target (Y) using features (X). Formula for linear regression is a linear function of features. The aim is to choose coefficients (Teta) of the prediction function in such a way that the difference between target and prediction is least in average. This difference between target and prediction is called loss function. The form of this loss function could be dependent from the particular real"
Ilnur I. - "I can try to summarize their discussion as I remembered. Linear regression is one of the method to predict target (Y) using features (X). Formula for linear regression is a linear function of features. The aim is to choose coefficients (Teta) of the prediction function in such a way that the difference between target and prediction is least in average. This difference between target and prediction is called loss function. The form of this loss function could be dependent from the particular real"See full answer
Data Scientist
Machine Learning
+2 more
Design an ML monitoring system for a fantasy sports app, focusing on drift, performance, outliers, and quality.
Machine Learning Engineer
Machine Learning
+1 more
1 answer I was asked this
"For data distribution drift: DL Divergence or PSI (Population Stability Index) performance: two categories: 1st operational metrics: runtime. 2nd model performance: loss function, MAE (regression), business metrics: overall watch time, DAU, revenue lift etc Outlier: data distribution"
L B. - "For data distribution drift: DL Divergence or PSI (Population Stability Index) performance: two categories: 1st operational metrics: runtime. 2nd model performance: loss function, MAE (regression), business metrics: overall watch time, DAU, revenue lift etc Outlier: data distribution"See full answer
Machine Learning Engineer
Machine Learning
+1 more
Implement a 2D Convolutional Filter
Medium
Machine Learning
1 answer I was asked this
"I checked the unittest is giving a False assertion as you can see in the colab notebook below. F FAIL: testsimple (main_.Conv2dTest) Traceback (most recent call last): File "", line 19, in test_simple self.assertTrue(torch.equal(output, torch.tensor([[[[ 5., 1.], [ -2., -10.]]]]))) AssertionError: False is not true"
Abinash S. - "I checked the unittest is giving a False assertion as you can see in the colab notebook below. F FAIL: testsimple (main_.Conv2dTest) Traceback (most recent call last): File "", line 19, in test_simple self.assertTrue(torch.equal(output, torch.tensor([[[[ 5., 1.], [ -2., -10.]]]]))) AssertionError: False is not true"See full answer
Machine Learning
What are the advantages and limitations of linear regression?
Machine Learning
Concept
Add answer I was asked this
Machine Learning
Concept
How can we tell when a model needs to be refreshed?
Machine Learning
Concept
Add answer I was asked this
Machine Learning
Concept
How would you handle an exploding gradient, given a neural network?
Machine Learning
Concept
Add answer I was asked this
Machine Learning
Concept
What's the importance of feature scaling and normalization?
Machine Learning
Concept
Add answer I was asked this
Machine Learning
Concept
Explain training and testing data.
Machine Learning
Concept
Add answer I was asked this
Machine Learning
Concept
What's the difference between classification and regression?
Machine Learning
Concept
Add answer I was asked this
Machine Learning
Concept
Asked at Scale AI • 2 years ago
Describe the distribution between the 5th and 6th points in an interval from 0 to 1 containing ten uniformly distributed points.
Machine Learning Engineer
Machine Learning
+1 more
Add answer I was asked this
Machine Learning Engineer
Machine Learning
+1 more