Skip to main content

Top Machine Learning Engineer Concept Interview Questions

Review this list of 71 Concept Machine Learning Engineer interview questions and answers verified by hiring managers and candidates.
  • OpenAI logoAsked at OpenAI 
    3 answers
    Video answer for 'How is gradient descent and model optimization used in linear regression?'

    "Gradient Descent = core engine for training most ML models It works by iteratively minimizing loss via gradients Many improvements exist (Adam, RMSProp, etc.) Alternatives exist for: Faster convergence Non-differentiable problems Direct metric optimization"

    Dessalew A. - "Gradient Descent = core engine for training most ML models It works by iteratively minimizing loss via gradients Many improvements exist (Adam, RMSProp, etc.) Alternatives exist for: Faster convergence Non-differentiable problems Direct metric optimization"See full answer

    Machine Learning Engineer
    Concept
    +1 more
  • Goldman Sachs logoAsked at Goldman Sachs 
    4 answers
    Video answer for 'Explain Bayes' theorem.'
    +1

    "Is it bad to get the answer a different way? Will they mark that as not knowing Bayes Theorem or just correct as it is an easier way to get the answer? The way I went is to look at what happens when the factory makes 100 light bulbs. Machine A makes 60 of which 3 are faulty, Machine B makes 40 of which 1.2 are faulty. Therefore the pool of faulty lightbulbs is 3/4.2 = 5/7 from machine A and 1.2/4.2 = 3/7 from Machine B."

    Will I. - "Is it bad to get the answer a different way? Will they mark that as not knowing Bayes Theorem or just correct as it is an easier way to get the answer? The way I went is to look at what happens when the factory makes 100 light bulbs. Machine A makes 60 of which 3 are faulty, Machine B makes 40 of which 1.2 are faulty. Therefore the pool of faulty lightbulbs is 3/4.2 = 5/7 from machine A and 1.2/4.2 = 3/7 from Machine B."See full answer

    Machine Learning Engineer
    Concept
    +2 more
  • Microsoft logoAsked at Microsoft 
    5 answers
    Video answer for 'How do you select the value of 'k' in the k-means algorithm?'
    +2

    "As an interviewer, I have asked this question to candidates in the past. Here are the major topics I am looking for in an interview The candidate should understand that there are ways of measuring the loss of a particular clustering. For example, we can take the average distance of each point to it's cluster center. The candidate should understand that this loss will always decrease as the number of clusters increases. For that reason, we can't just pick the value of K that minimizes the l"

    Michael F. - "As an interviewer, I have asked this question to candidates in the past. Here are the major topics I am looking for in an interview The candidate should understand that there are ways of measuring the loss of a particular clustering. For example, we can take the average distance of each point to it's cluster center. The candidate should understand that this loss will always decrease as the number of clusters increases. For that reason, we can't just pick the value of K that minimizes the l"See full answer

    Machine Learning Engineer
    Concept
    +1 more
  • OpenAI logoAsked at OpenAI 
    Add answer
    Video answer for 'How do you select input for modeling if there are features highly correlated with each other?'
    Machine Learning Engineer
    Concept
    +2 more
  • Google logoAsked at Google 
    5 answers
    +2

    "Deep Learning is a part of Artificial Intelligence, it's like teaching the machine to think and make decisions on its own. It's like how we teach a child the concept of an apple - it's round, red, has a stem on top. We show them multiple pictures of apples and then they understand and can recognize an apple in future. Similarly, we feed lots of data to the machine, and slowly, it starts learning from that data, and can then make relevant predictions or decisions based on what it has learnt. A co"

    Surbhi G. - "Deep Learning is a part of Artificial Intelligence, it's like teaching the machine to think and make decisions on its own. It's like how we teach a child the concept of an apple - it's round, red, has a stem on top. We show them multiple pictures of apples and then they understand and can recognize an apple in future. Similarly, we feed lots of data to the machine, and slowly, it starts learning from that data, and can then make relevant predictions or decisions based on what it has learnt. A co"See full answer

    Machine Learning Engineer
    Concept
    +3 more
  • 🧠 Want an expert answer to a question? Saving questions lets us know what content to make next.

  • Amazon logoAsked at Amazon 
    4 answers
    +1

    "in simple words, linear regression helps in predicting the value whereas logistics regression helps in predicting the binary classification. But lets talk through some example Linear regression model: E-commerce website pricing recommendation engine is built on linear regression model where we do have some variables such as competitor price, internal economics and consumer demand etc when we put this in a supervised learning model, it helps in predicting prices Logistics regression model"

    Anonymous Aardvark - "in simple words, linear regression helps in predicting the value whereas logistics regression helps in predicting the binary classification. But lets talk through some example Linear regression model: E-commerce website pricing recommendation engine is built on linear regression model where we do have some variables such as competitor price, internal economics and consumer demand etc when we put this in a supervised learning model, it helps in predicting prices Logistics regression model"See full answer

    Machine Learning Engineer
    Concept
    +1 more
  • Google logoAsked at Google 
    5 answers
    +1

    "DNNs can learn hierarchical features, with each layer learning progressively more abstract features, and generalizes better. SNNs are better for simplier problems involving smaller datasets and if low latency is required."

    Louie Z. - "DNNs can learn hierarchical features, with each layer learning progressively more abstract features, and generalizes better. SNNs are better for simplier problems involving smaller datasets and if low latency is required."See full answer

    Machine Learning Engineer
    Concept
    +2 more
  • Snap logoAsked at Snap 
    8 answers
    +4

    "Perplexity - the measure LLM is surprised when it predicts next word. For example: I love to eat --- if LLM selects as next word "fruits" it will be less surprising than if LLM selects as next word "metal". It is better to have lower perplexity score. Cross Entropy is the measure how well model match true labels. So if the next word is "cat" and LLM assigns 0.5 to it then cross entropy value is -log(0.5) = 0.69 and if it assigns 0.9 probability to word cat then cross entropy value is - log(0.9"

    Alex N. - "Perplexity - the measure LLM is surprised when it predicts next word. For example: I love to eat --- if LLM selects as next word "fruits" it will be less surprising than if LLM selects as next word "metal". It is better to have lower perplexity score. Cross Entropy is the measure how well model match true labels. So if the next word is "cat" and LLM assigns 0.5 to it then cross entropy value is -log(0.5) = 0.69 and if it assigns 0.9 probability to word cat then cross entropy value is - log(0.9"See full answer

    Machine Learning Engineer
    Concept
    +2 more
  • Anthropic logoAsked at Anthropic 
    2 answers

    "There are many good answers to this that AI scientists around the world, I and my coworkers have tried over the years. For one, RAG is a great option to fact-check and enforce citation generation, update the data in the knowledge base of the generative AI, etc. "

    Nathan B. - "There are many good answers to this that AI scientists around the world, I and my coworkers have tried over the years. For one, RAG is a great option to fact-check and enforce citation generation, update the data in the knowledge base of the generative AI, etc. "See full answer

    Machine Learning Engineer
    Concept
    +4 more
  • OpenAI logoAsked at OpenAI 
    3 answers

    "The adjusting context window size in LLM change trade off between reasoning capability, accuracy, computation cost. It influence how attention mechanist allocate resources across the input. Longer context window let it you input greater number of words and have more context to generate proper next token. However llms have lost in the middle issue. They remember the beginning of text and end of text but lost information located in the middle of long input. Another problem is Attention Dilution."

    Alex N. - "The adjusting context window size in LLM change trade off between reasoning capability, accuracy, computation cost. It influence how attention mechanist allocate resources across the input. Longer context window let it you input greater number of words and have more context to generate proper next token. However llms have lost in the middle issue. They remember the beginning of text and end of text but lost information located in the middle of long input. Another problem is Attention Dilution."See full answer

    Machine Learning Engineer
    Concept
    +4 more
  • Google logoAsked at Google 
    4 answers
    +1

    "supervised learning: model is trained on the labeled data unsupervised learning: no labels provided - model learns by finding patterns , structure and groupings in the data. Semi-supervised learning: use small set of labels to guide learning for the larger pool of unlabeled data. reinforcement learning: leans by interacting with students the environment, receives reward and penalties based on actions self supervised: no labelled data . The model makes its own practice problems by"

    Anchal V. - "supervised learning: model is trained on the labeled data unsupervised learning: no labels provided - model learns by finding patterns , structure and groupings in the data. Semi-supervised learning: use small set of labels to guide learning for the larger pool of unlabeled data. reinforcement learning: leans by interacting with students the environment, receives reward and penalties based on actions self supervised: no labelled data . The model makes its own practice problems by"See full answer

    Machine Learning Engineer
    Concept
    +1 more
  • Nvidia logoAsked at Nvidia 
    4 answers
    +1

    "Over-fitting of a model occurs when model fails to generalize to any new data and has high variance withing training data whereas in under fitting model isn't able to uncover the underlying pattern in the training data and high bias. Tree based model like decision tree and random forest are likely to overfit whereas linear models like linear regression and logistic regression tends to under fit. There are many reasons why a Random forest can overfits easily 1. Model has grown to its full depth a"

    Jyoti V. - "Over-fitting of a model occurs when model fails to generalize to any new data and has high variance withing training data whereas in under fitting model isn't able to uncover the underlying pattern in the training data and high bias. Tree based model like decision tree and random forest are likely to overfit whereas linear models like linear regression and logistic regression tends to under fit. There are many reasons why a Random forest can overfits easily 1. Model has grown to its full depth a"See full answer

    Machine Learning Engineer
    Concept
    +2 more
  • Anthropic logoAsked at Anthropic 
    4 answers
    +1

    "Hallucinations are evaluated by measuring how often generated outputs contain information that is not supported by trusted sources. what hallucination means in context: Intrinsic hallucination: contradicts provided context Extrinsic hallucination: introduces unsupported facts Fabrication: confidently incorrect answers"

    Hardik saurabh G. - "Hallucinations are evaluated by measuring how often generated outputs contain information that is not supported by trusted sources. what hallucination means in context: Intrinsic hallucination: contradicts provided context Extrinsic hallucination: introduces unsupported facts Fabrication: confidently incorrect answers"See full answer

    Machine Learning Engineer
    Concept
    +4 more
  • Anthropic logoAsked at Anthropic 
    Add answer
    Machine Learning Engineer
    Concept
    +5 more
  • Apple logoAsked at Apple 
    2 answers

    "Hey Grandma, you've had a lot of experience with infants, haven't you? When they were babies, you taught them how to chew in their first six months. This initial phase is like giving them data. Once they learned how to chew, they could handle any food you gave them. Next, you refined their learning by teaching them that they should only chew on food. This is like refining the data so they understand what is relevant. Then, a few months later, they started crawling and walking, learning by observ"

    Hari priya K. - "Hey Grandma, you've had a lot of experience with infants, haven't you? When they were babies, you taught them how to chew in their first six months. This initial phase is like giving them data. Once they learned how to chew, they could handle any food you gave them. Next, you refined their learning by teaching them that they should only chew on food. This is like refining the data so they understand what is relevant. Then, a few months later, they started crawling and walking, learning by observ"See full answer

    Machine Learning Engineer
    Concept
  • Apple logoAsked at Apple 
    Add answer
    Machine Learning Engineer
    Concept
    +1 more
  • Anthropic logoAsked at Anthropic 
    Add answer
    Machine Learning Engineer
    Concept
    +5 more
  • Google logoAsked at Google 
    2 answers

    "Grandma! You know how we can look at a picture and know what's in it—like seeing a cat or a dog? Computers can learn to do that too! It's just they use special tricks and math to see and understand pictures or videos. It helps them figure out what's in the pictures, almost like how we do! Almost like giving it eyes to see the world in its own way!"

    Praveen D. - "Grandma! You know how we can look at a picture and know what's in it—like seeing a cat or a dog? Computers can learn to do that too! It's just they use special tricks and math to see and understand pictures or videos. It helps them figure out what's in the pictures, almost like how we do! Almost like giving it eyes to see the world in its own way!"See full answer

    Machine Learning Engineer
    Concept
  • Machine Learning Engineer
    Concept
    +1 more
  • Apple logoAsked at Apple 
    Add answer
    Machine Learning Engineer
    Concept
Showing 1-20 of 71