Machine Learning Interview Questions

Review this list of 84 machine learning interview questions and answers verified by hiring managers and candidates.
  • Anthropic logoAsked at Anthropic 
    Machine Learning Engineer
    Machine Learning
    +2 more
  • Machine Learning Engineer
    Machine Learning
    +3 more
  • Amazon logoAsked at Amazon 

    "in simple words, linear regression helps in predicting the value whereas logistics regression helps in predicting the binary classification. But lets talk through some example Linear regression model: E-commerce website pricing recommendation engine is built on linear regression model where we do have some variables such as competitor price, internal economics and consumer demand etc when we put this in a supervised learning model, it helps in predicting prices Logistics regression model"

    Anonymous Aardvark - "in simple words, linear regression helps in predicting the value whereas logistics regression helps in predicting the binary classification. But lets talk through some example Linear regression model: E-commerce website pricing recommendation engine is built on linear regression model where we do have some variables such as competitor price, internal economics and consumer demand etc when we put this in a supervised learning model, it helps in predicting prices Logistics regression model"See full answer

    Machine Learning Engineer
    Machine Learning
    +1 more
  • Meta (Facebook) logoAsked at Meta (Facebook) 
    Video answer for 'Design a fake news detection system.'

    " Functional Requirements Content Ingestion\: Ingest news articles from various sources (websites, social media, etc.). Handle different types of content (text, images, videos). Content Analysis\: Extract and preprocess text from articles. Analyze the content for potential indicators of fake news. Model Training and Prediction\: Use machine learning models to classify content as fake or real. Continuously improve models with new data and f"

    Scott S. - " Functional Requirements Content Ingestion\: Ingest news articles from various sources (websites, social media, etc.). Handle different types of content (text, images, videos). Content Analysis\: Extract and preprocess text from articles. Analyze the content for potential indicators of fake news. Model Training and Prediction\: Use machine learning models to classify content as fake or real. Continuously improve models with new data and f"See full answer

    Technical Program Manager
    Machine Learning
    +3 more
  • Amazon logoAsked at Amazon 
    Machine Learning Engineer
    Machine Learning
    +1 more
  • 🧠 Want an expert answer to a question? Saving questions lets us know what content to make next.

  • +1

    "import numpy as np class Centroid: def init(self, location, vectors): self.location = location # (D,) self.vectors = vectors # (N_i, D) class KMeans: def init(self, n_features, k): self.nfeatures = nfeatures self.centroids = [ Centroid( location=np.random.randn(n_features), vectors=np.empty((0, n_features)) ) for _ in range(k) ] def distance(self, x,"

    Dinesh G. - "import numpy as np class Centroid: def init(self, location, vectors): self.location = location # (D,) self.vectors = vectors # (N_i, D) class KMeans: def init(self, n_features, k): self.nfeatures = nfeatures self.centroids = [ Centroid( location=np.random.randn(n_features), vectors=np.empty((0, n_features)) ) for _ in range(k) ] def distance(self, x,"See full answer

    Machine Learning
    Coding
  • TikTok logoAsked at TikTok 
    Machine Learning Engineer
    Machine Learning
    +1 more
  • Machine Learning
    System Design
  • Machine Learning Engineer
    Machine Learning
    +1 more
  • Amazon logoAsked at Amazon 
    Video answer for 'Implement k-means clustering.'

    "i dont know"

    Dinesh K. - "i dont know"See full answer

    Machine Learning Engineer
    Machine Learning
    +5 more
  • TikTok logoAsked at TikTok 

    "class Solution: def lengthOfLIS(self, nums: List[int]) -> int: temp = [nums[0]] for num in nums: if temp[-1]< num: temp.append(num) else: index = bisect_left(temp,num) temp[index] = num return len(temp) "

    Mahima M. - "class Solution: def lengthOfLIS(self, nums: List[int]) -> int: temp = [nums[0]] for num in nums: if temp[-1]< num: temp.append(num) else: index = bisect_left(temp,num) temp[index] = num return len(temp) "See full answer

    Machine Learning Engineer
    Machine Learning
    +1 more
  • Anthropic logoAsked at Anthropic 
    Machine Learning Engineer
    Machine Learning
    +1 more
  • Microsoft logoAsked at Microsoft 
    Video answer for 'How do you select the value of 'k' in the k-means algorithm?'
    +1

    "As an interviewer, I have asked this question to candidates in the past. Here are the major topics I am looking for in an interview The candidate should understand that there are ways of measuring the loss of a particular clustering. For example, we can take the average distance of each point to it's cluster center. The candidate should understand that this loss will always decrease as the number of clusters increases. For that reason, we can't just pick the value of K that minimizes the l"

    Michael F. - "As an interviewer, I have asked this question to candidates in the past. Here are the major topics I am looking for in an interview The candidate should understand that there are ways of measuring the loss of a particular clustering. For example, we can take the average distance of each point to it's cluster center. The candidate should understand that this loss will always decrease as the number of clusters increases. For that reason, we can't just pick the value of K that minimizes the l"See full answer

    Machine Learning Engineer
    Machine Learning
    +1 more
  • "I've watched all the ML Systems designs interviews and this solution provides a clean baseline for predicting ETA using historical averages, but it falls short of addressing the broader problem of route planning. The system predicts ETA for a given segment and time interval, but it doesn’t explain how to compute the ETA for an entire route or how to integrate this into dynamic path selection. It also lacks depth on handling real-time data, adapting to distribution shift, or reacting to sudden"

    Clayton P. - "I've watched all the ML Systems designs interviews and this solution provides a clean baseline for predicting ETA using historical averages, but it falls short of addressing the broader problem of route planning. The system predicts ETA for a given segment and time interval, but it doesn’t explain how to compute the ETA for an entire route or how to integrate this into dynamic path selection. It also lacks depth on handling real-time data, adapting to distribution shift, or reacting to sudden"See full answer

    Machine Learning
    System Design
  • Machine Learning
    System Design
  • Amazon logoAsked at Amazon 
    Video answer for 'Implement a k-nearest neighbors algorithm.'
    +4

    "Even more faster and vectorized version, using np.linalg.norm - to avoid loop and np.argpartition to select lowest k. We dont need to sort whole array - we need to be sure that first k elements are lower than the rest. import numpy as np def knn(Xtrain, ytrain, X_new, k): distances = np.linalg.norm(Xtrain - Xnew, axis=1) k_indices = np.argpartition(distances, k)[:k] # O(N) selection instead of O(N log N) sort return int(np.sum(ytrain[kindices]) > k / 2.0) `"

    Dinar M. - "Even more faster and vectorized version, using np.linalg.norm - to avoid loop and np.argpartition to select lowest k. We dont need to sort whole array - we need to be sure that first k elements are lower than the rest. import numpy as np def knn(Xtrain, ytrain, X_new, k): distances = np.linalg.norm(Xtrain - Xnew, axis=1) k_indices = np.argpartition(distances, k)[:k] # O(N) selection instead of O(N log N) sort return int(np.sum(ytrain[kindices]) > k / 2.0) `"See full answer

    Machine Learning Engineer
    Machine Learning
    +2 more
  • +1

    "Over-fitting of a model occurs when model fails to generalize to any new data and has high variance withing training data whereas in under fitting model isn't able to uncover the underlying pattern in the training data and high bias. Tree based model like decision tree and random forest are likely to overfit whereas linear models like linear regression and logistic regression tends to under fit. There are many reasons why a Random forest can overfits easily 1. Model has grown to its full depth a"

    Jyoti V. - "Over-fitting of a model occurs when model fails to generalize to any new data and has high variance withing training data whereas in under fitting model isn't able to uncover the underlying pattern in the training data and high bias. Tree based model like decision tree and random forest are likely to overfit whereas linear models like linear regression and logistic regression tends to under fit. There are many reasons why a Random forest can overfits easily 1. Model has grown to its full depth a"See full answer

    Machine Learning Engineer
    Machine Learning
    +2 more
  • Machine Learning
    Coding
  • Machine Learning Engineer
    Machine Learning
    +1 more
  • Machine Learning Engineer
    Machine Learning
    +1 more
Showing 21-40 of 84