Skip to main content

Data Scientist Interview Questions

Review this list of 174 Data Scientist interview questions and answers verified by hiring managers and candidates.
  • "[I'm not sure whether the answer below is the best, as I have not gotten result and feedback from my interview] Ans: I would solve by first using a VAE-style model, to create a latent space embedding that translates user description to generate images. Training would be done on the 1000 avatar images and 100000 descriptions, following this scheme: VAE: description -> encoder -> latent space -> decoder -> image Q: "OK, but that means you're limiting the generated images to be only the 1000 imag"

    Nick S. - "[I'm not sure whether the answer below is the best, as I have not gotten result and feedback from my interview] Ans: I would solve by first using a VAE-style model, to create a latent space embedding that translates user description to generate images. Training would be done on the 1000 avatar images and 100000 descriptions, following this scheme: VAE: description -> encoder -> latent space -> decoder -> image Q: "OK, but that means you're limiting the generated images to be only the 1000 imag"See full answer

    Data Scientist
    Machine Learning
  • TikTok logoAsked at TikTok 
    3 answers

    "I generate insights through stakeholder requirements and the data I have in hand"

    Anonymous Eagle - "I generate insights through stakeholder requirements and the data I have in hand"See full answer

    Data Scientist
    Analytical
    +1 more
  • McKinsey logoAsked at McKinsey 
    1 answer

    "The cases where data is under heavy outlier influence. Since mean fluctuates due to the presence of an outlier, median might be a better measure"

    Himani E. - "The cases where data is under heavy outlier influence. Since mean fluctuates due to the presence of an outlier, median might be a better measure"See full answer

    Data Scientist
    Statistics & Experimentation
  • DoorDash logoAsked at DoorDash 
    Add answer
    Data Scientist
    Data Analysis
  • Slack logoAsked at Slack 
    2 answers

    "The only time I felt unhappy at work was when I didn’t get to pick the job I want to do at the jobsite."

    Amparo L. - "The only time I felt unhappy at work was when I didn’t get to pick the job I want to do at the jobsite."See full answer

    Data Scientist
    Behavioral
  • 🧠 Want an expert answer to a question? Saving questions lets us know what content to make next.

  • "A Random Forest works by building an ensemble of decision trees, each trained on a slightly different version of the data. The key mechanism is bagging: for each tree, we sample the training data with replacement (bootstrapping), so every tree sees a different subset of examples. On top of that, at each split the algorithm randomly selects a subset of features, so trees explore different predictors. These two sources of randomness decorrelate the trees. When we aggregate them — by averag"

    Yuexiang Y. - "A Random Forest works by building an ensemble of decision trees, each trained on a slightly different version of the data. The key mechanism is bagging: for each tree, we sample the training data with replacement (bootstrapping), so every tree sees a different subset of examples. On top of that, at each split the algorithm randomly selects a subset of features, so trees explore different predictors. These two sources of randomness decorrelate the trees. When we aggregate them — by averag"See full answer

    Data Scientist
    Technical
  • "Probability that one of the coupons is used = 1 - Probability that no coupon is used = 1 - nC0 p^0 * (1-p)^n = 1 -(1-p)^n"

    Chetak C. - "Probability that one of the coupons is used = 1 - Probability that no coupon is used = 1 - nC0 p^0 * (1-p)^n = 1 -(1-p)^n"See full answer

    Data Scientist
    Statistics & Experimentation
  • Apple logoAsked at Apple 
    9 answers
    +5

    "Make current as root. 2 while current is not null, if p and q are less than current, go left. If p and q are greater than current, go right. else return current. return null"

    Vaibhav D. - "Make current as root. 2 while current is not null, if p and q are less than current, go left. If p and q are greater than current, go right. else return current. return null"See full answer

    Data Scientist
    Data Structures & Algorithms
    +4 more
  • Apple logoAsked at Apple 

    Coin Change

    IDE
    Medium
    12 answers
    +9

    "The example given is wrong. The 2nd test case should have answer 3, as we can get to 6 by using 3 coins of denomination 2."

    Anmol R. - "The example given is wrong. The 2nd test case should have answer 3, as we can get to 6 by using 3 coins of denomination 2."See full answer

    Data Scientist
    Coding
    +4 more
  • Salesforce logoAsked at Salesforce 
    Add answer
    Data Scientist
    Behavioral
    +4 more
  • 1 answer

    "The algorithm calculates certain metrics like entropy & Gini Impurity. The goal of the decision tree algorithm is to find the most optimal value for these metrics, lowest values for Gini Impurity & Entropy. Once it converges on the minima, it creates a split & grows the branches."

    Saurabh J. - "The algorithm calculates certain metrics like entropy & Gini Impurity. The goal of the decision tree algorithm is to find the most optimal value for these metrics, lowest values for Gini Impurity & Entropy. Once it converges on the minima, it creates a split & grows the branches."See full answer

    Data Scientist
    Concept
    +1 more
  • Tinder logoAsked at Tinder 
    Add answer
    Data Scientist
    Behavioral
  • Discord logoAsked at Discord 
    Add answer
    Data Scientist
    Behavioral
    +2 more
  • Adobe logoAsked at Adobe 
    1 answer

    "Leetcode 347: Heap + Hashtable Follow up question: create heap with the length of K instead of N (more time complexity but less space )"

    Chen J. - "Leetcode 347: Heap + Hashtable Follow up question: create heap with the length of K instead of N (more time complexity but less space )"See full answer

    Data Scientist
    Data Structures & Algorithms
    +3 more
  • Infosys logoAsked at Infosys 
    1 answer

    "In Python, an "oops" (Object-Oriented Programming) concept refers to a programming paradigm that is based on the idea of objects and classes. OOP allows developers to model real-world concepts and create reusable code blocks through the use of inheritance, polymorphism, and encapsulation. Here are some common OOP concepts in Python: Class: A class is a blueprint for creating objects. It defines the attributes and behaviors that objects of that class will have. Object: An object is an insta"

    Anonymous Flamingo - "In Python, an "oops" (Object-Oriented Programming) concept refers to a programming paradigm that is based on the idea of objects and classes. OOP allows developers to model real-world concepts and create reusable code blocks through the use of inheritance, polymorphism, and encapsulation. Here are some common OOP concepts in Python: Class: A class is a blueprint for creating objects. It defines the attributes and behaviors that objects of that class will have. Object: An object is an insta"See full answer

    Data Scientist
    Technical
  • Tinder logoAsked at Tinder 
    1 answer

    "I would recognize the factors that are causing the interference. Then i will use tools like smoothing techniques or algorithms (e.g Kalman filters for time series) which can help isolate genuine trends from noise. In testing i would employ techniqu es like A/B testing to measure interference from unrelated factors and use techniques like regression analysis to seperate the relevant factors from noise."

    Trusha M. - "I would recognize the factors that are causing the interference. Then i will use tools like smoothing techniques or algorithms (e.g Kalman filters for time series) which can help isolate genuine trends from noise. In testing i would employ techniqu es like A/B testing to measure interference from unrelated factors and use techniques like regression analysis to seperate the relevant factors from noise."See full answer

    Data Scientist
    Technical
  • Adobe logoAsked at Adobe 
    2 answers

    "#include #include #include using namespace std; void printComs(int prev, int start, int end, int target) { if (start >= end) return; while (start target) { end--; } else { st"

    Iris F. - "#include #include #include using namespace std; void printComs(int prev, int start, int end, int target) { if (start >= end) return; while (start target) { end--; } else { st"See full answer

    Data Scientist
    Data Structures & Algorithms
    +4 more
  • Oracle logoAsked at Oracle 
    2 answers

    "def countuniqueoutfits(totalpants: int, uniquepants: int, totalshirts: int, uniqueshirts: int, totalhats: int, uniquehats: int) -> int: """ Number of unique outfits can simply be defined by (uniquepantschoose1uniqueshirtschoose1uniquehatschoose_1) (uniquepantschoose1*uniqueshirtschoose1) # Not wearing a hat nchoosek is n """ res = (uniquepants*uniqueshirtsuniquehats) + (uniquepantsunique_shirts) return res print(countuniqueoutfits(2, 1, 1, 1, 3, 2))"

    Sai R. - "def countuniqueoutfits(totalpants: int, uniquepants: int, totalshirts: int, uniqueshirts: int, totalhats: int, uniquehats: int) -> int: """ Number of unique outfits can simply be defined by (uniquepantschoose1uniqueshirtschoose1uniquehatschoose_1) (uniquepantschoose1*uniqueshirtschoose1) # Not wearing a hat nchoosek is n """ res = (uniquepants*uniqueshirtsuniquehats) + (uniquepantsunique_shirts) return res print(countuniqueoutfits(2, 1, 1, 1, 3, 2))"See full answer

    Data Scientist
    Coding
  • DoorDash logoAsked at DoorDash 
    Add answer
    Data Scientist
    Data Analysis
    +3 more
  • Figma logoAsked at Figma 
    Add answer
    Data Scientist
    Behavioral
Showing 141-160 of 174