Machine Learning Interview Questions

Review this list of 68 machine learning interview questions and answers verified by hiring managers and candidates.

+ Add interview

Product

Engineering

Operations

Design

Marketing

Data

Sales

Finance

Consulting

Security

Add interview

Machine Learning Engineer Data Scientist Technical Program Manager Software Engineer Product Manager Engineering Manager

Recommend similar artists on Spotify.
Machine Learning
System Design
Add answer I was asked this
Machine Learning
System Design
Asked at Amazon • 3 years ago
Design an app where customer listening history is collected after they have listened to a song for more than 30 seconds for analytics use.
Technical Program Manager
Machine Learning
+1 more
1 answer I was asked this
"I gave multiple answers including polling the service every 10 sec to see customer. Or we can have the client side call which will send this data after 10 sec to us. We will store in dynamo DB and then send through pipelines to redshift DB for analytics."
Deepti K. - "I gave multiple answers including polling the service every 10 sec to see customer. Or we can have the client side call which will send this data after 10 sec to us. We will store in dynamo DB and then send through pipelines to redshift DB for analytics."See full answer
Technical Program Manager
Machine Learning
+1 more
Asked at Samsung • 2 years ago
Design a system that filters waste by detecting paper and putting it in the correct bin.
Machine Learning Engineer
Machine Learning
+1 more
1 answer I was asked this
"I've worked on projects not quite like this, but very similar, in the past - I'll borrow from that to answer this: The Broader Context this problem doesn't specify the type of data we're working with, or how it's being ingested to align with my personal background, I'll assume a picture that lends this problem well to being a computer vision (abbreviated "CV") related question: let's say we have a conveyor belt in a waste facility, which sequentially carries a stream of waste w"
Zain R. - "I've worked on projects not quite like this, but very similar, in the past - I'll borrow from that to answer this: The Broader Context this problem doesn't specify the type of data we're working with, or how it's being ingested to align with my personal background, I'll assume a picture that lends this problem well to being a computer vision (abbreviated "CV") related question: let's say we have a conveyor belt in a waste facility, which sequentially carries a stream of waste w"See full answer
Machine Learning Engineer
Machine Learning
+1 more
Asked at Amazon, TikTok • 2 years ago
Explain bias-variance tradeoff.
Machine Learning Engineer
Machine Learning
+1 more
Add answer I was asked this
Machine Learning Engineer
Machine Learning
+1 more
Explain gradient descent.
Machine Learning
Concept
Add answer I was asked this
Machine Learning
Concept

🧠 Want an expert answer to a question? Saving questions lets us know what content to make next.

Asked at SAP • 2 years ago
You have a dataset comprising 1,000 avatar images and 100,000 user descriptions with associated avatar images. Create a model that recommends an image from a new set of 100,000 images for a user de...
Data Scientist
Machine Learning
2 answers I was asked this
"[I'm not sure whether the answer below is the best, as I have not gotten result and feedback from my interview] Ans: I would solve by first using a VAE-style model, to create a latent space embedding that translates user description to generate images. Training would be done on the 1000 avatar images and 100000 descriptions, following this scheme: VAE: description -> encoder -> latent space -> decoder -> image Q: "OK, but that means you're limiting the generated images to be only the 1000 imag"
Nick S. - "[I'm not sure whether the answer below is the best, as I have not gotten result and feedback from my interview] Ans: I would solve by first using a VAE-style model, to create a latent space embedding that translates user description to generate images. Training would be done on the 1000 avatar images and 100000 descriptions, following this scheme: VAE: description -> encoder -> latent space -> decoder -> image Q: "OK, but that means you're limiting the generated images to be only the 1000 imag"See full answer
Data Scientist
Machine Learning
Asked at Capital One, Google, Netflix + 1 more • 2 years ago
Tell me about a machine learning project you worked on.
Machine Learning Engineer
Machine Learning
Add answer I was asked this
Machine Learning Engineer
Machine Learning
Asked at Google • 3 years ago
You received user feedback that Yelp's restaurant recommendation is not personal enough. What would you do?
Product Manager
Machine Learning
+2 more
1 answer I was asked this
"Zero in on the problem, the expectations of user are to find a restaurant but their feed is uninspired so they may bounce out of Yelp. Identify the impact size of user feeling like discovery is not personalised enough by seeing % of users that selected a restaurant from the homepage If large enough, I will look at who is likely the ones that want personalisation and why? Do they feel like they want to try new restaurants or are they finding it difficult to find restaurants they have been"
Chermaine Y. - "Zero in on the problem, the expectations of user are to find a restaurant but their feed is uninspired so they may bounce out of Yelp. Identify the impact size of user feeling like discovery is not personalised enough by seeing % of users that selected a restaurant from the homepage If large enough, I will look at who is likely the ones that want personalisation and why? Do they feel like they want to try new restaurants or are they finding it difficult to find restaurants they have been"See full answer
Product Manager
Machine Learning
+2 more
How do you ensure a deployed model remains up to date?
Machine Learning
System Design
1 answer I was asked this
"We should define automated Scheduled Updates for deployment to keep the application up to date."
Rahul M. - "We should define automated Scheduled Updates for deployment to keep the application up to date."See full answer
Machine Learning
System Design
Is the Mean Squared Error (MSE) a suitable cost function for a logistic regression model?
Machine Learning
Concept
1 answer I was asked this
"No ,MSE is suitable for only regression modes. Although the logistic regression in Its name has regression , but it is a classification problem so MSE is not suitable for classification models like logistic regression."
1036 loknadh R. - "No ,MSE is suitable for only regression modes. Although the logistic regression in Its name has regression , but it is a classification problem so MSE is not suitable for classification models like logistic regression."See full answer
Machine Learning
Concept
Which is more likely to result in overfitting a model? More training data, decreased nodes in neural network hidden layers, eliminating sparse features, or using a Gaussian or RBF kernel in SVM?
Machine Learning
Concept
1 answer I was asked this
"Switching from a linear kernel to RBF / Gaussian kernel is likely to result in overfitting the model. It is a move that adds complexity to the mix, and if the data doesn't need that sort of complexity, it would result in overfitting. On the other hand, all the other three approaches would only try too reduce complexity in the process, thereby doesn't contribute to overfitting the model."
Sri V. - "Switching from a linear kernel to RBF / Gaussian kernel is likely to result in overfitting the model. It is a move that adds complexity to the mix, and if the data doesn't need that sort of complexity, it would result in overfitting. On the other hand, all the other three approaches would only try too reduce complexity in the process, thereby doesn't contribute to overfitting the model."See full answer
Machine Learning
Concept
Asked at Amazon • 2 years ago
Tell me about a machine learning project you worked on and its outcome.
Machine Learning Engineer
Machine Learning
Add answer I was asked this
Machine Learning Engineer
Machine Learning
Asked at Capital One • 2 years ago
Tell me about a time when you worked with a difficult team member in a research setting.
Machine Learning Engineer
Machine Learning
Add answer I was asked this
Machine Learning Engineer
Machine Learning
What is the significance of an area under the curve (AUC) equal to 0.5?
Machine Learning
Concept
1 answer I was asked this
"AUC 0.5 equates to a random model, so when creating any machine learning model or statistical model, you ideally want your model to at least beat this random baseline."
Harsh S. - "AUC 0.5 equates to a random model, so when creating any machine learning model or statistical model, you ideally want your model to at least beat this random baseline."See full answer
Machine Learning
Concept
Describe how the split in a decision tree works.
Data Scientist
Machine Learning
+1 more
Add answer I was asked this
Data Scientist
Machine Learning
+1 more
Asked at Microsoft • 2 years ago
Tell me about a machine learning research topic you find interesting.
Machine Learning Engineer
Machine Learning
Add answer I was asked this
Machine Learning Engineer
Machine Learning
Describe how random forests work and explain what an ensemble method is.
Machine Learning
Concept
1 answer I was asked this
"Random Forest is a machine learning model used for classification problems or regression problems. It can handle binary classification as well as multi-class classification. It is a very efficient model and is great for a baseline or used in a service that needs extremely low latency depending on the size of the model. It's also a good option for wide datasets (dataset with many features) due to it's random subset of features. it is slightly less optimized for deep datasets on very large dataset"
Jake M. - "Random Forest is a machine learning model used for classification problems or regression problems. It can handle binary classification as well as multi-class classification. It is a very efficient model and is great for a baseline or used in a service that needs extremely low latency depending on the size of the model. It's also a good option for wide datasets (dataset with many features) due to it's random subset of features. it is slightly less optimized for deep datasets on very large dataset"See full answer
Machine Learning
Concept
Describe the model you would build to predict fraudulent credit card transactions.
Machine Learning
Concept
Add answer I was asked this
Machine Learning
Concept
Explain linear regression to a non-technical stakeholder.
Machine Learning
Concept
Add answer I was asked this
Machine Learning
Concept
What metrics would you consider to assess your model's performance?
Machine Learning
Concept
Add answer I was asked this
Machine Learning
Concept

Showing 41-60 of 68

Interviewed recently?

Help improve our question database (and earn karma) by telling us about your experience

Trending companies

Machine Learning Interview Questions

Recommend similar artists on Spotify.

Design an app where customer listening history is collected after they have listened to a song for more than 30 seconds for analytics use.

Design a system that filters waste by detecting paper and putting it in the correct bin.

Explain bias-variance tradeoff.

Explain gradient descent.

You have a dataset comprising 1,000 avatar images and 100,000 user descriptions with associated avatar images. Create a model that recommends an image from a new set of 100,000 images for a user de...

Tell me about a machine learning project you worked on.

You received user feedback that Yelp's restaurant recommendation is not personal enough. What would you do?

How do you ensure a deployed model remains up to date?

Is the Mean Squared Error (MSE) a suitable cost function for a logistic regression model?

Which is more likely to result in overfitting a model? More training data, decreased nodes in neural network hidden layers, eliminating sparse features, or using a Gaussian or RBF kernel in SVM?

Tell me about a machine learning project you worked on and its outcome.

Tell me about a time when you worked with a difficult team member in a research setting.

What is the significance of an area under the curve (AUC) equal to 0.5?

Describe how the split in a decision tree works.

Tell me about a machine learning research topic you find interesting.

Describe how random forests work and explain what an ensemble method is.

Describe the model you would build to predict fraudulent credit card transactions.

Explain linear regression to a non-technical stakeholder.

What metrics would you consider to assess your model's performance?

Explore questions by company

Explore questions by role