Skip to main content

Data Scientist Interview Questions

Review this list of 174 Data Scientist interview questions and answers verified by hiring managers and candidates.
  • Adobe logoAsked at Adobe 
    53 answers
    +49

    "a. Sort the array elements. b. take two pointers at index 0 and index Len-1; c. if the sum at the two pointers is target; break and return the pair d. if the sum is smaller, then move left pointer by 1 e. else move right pointer by 1; run the logic till the target is met or right pointer crosses the left pointer."

    Komal S. - "a. Sort the array elements. b. take two pointers at index 0 and index Len-1; c. if the sum at the two pointers is target; break and return the pair d. if the sum is smaller, then move left pointer by 1 e. else move right pointer by 1; run the logic till the target is met or right pointer crosses the left pointer."See full answer

    Data Scientist
    Data Structures & Algorithms
    +5 more
  • Amazon logoAsked at Amazon 
    5 answers
    +2

    "Situation: COVID has impacted everyone's lives, especially small businesses. Earlier this year, during the second lockdown in Malaysia, it was estimated that 50%-70% of small businesses have closed. It got me thinking, beyond the existing training programmes, what can my company do to support small businesses? Task: So, I took the initiative to gather our Comms and Government Affairs team, to work together and explore how we can: 1) meaningfully demonstrate our company's commitment in"

    Judy W. - "Situation: COVID has impacted everyone's lives, especially small businesses. Earlier this year, during the second lockdown in Malaysia, it was estimated that 50%-70% of small businesses have closed. It got me thinking, beyond the existing training programmes, what can my company do to support small businesses? Task: So, I took the initiative to gather our Comms and Government Affairs team, to work together and explore how we can: 1) meaningfully demonstrate our company's commitment in"See full answer

    Data Scientist
    Behavioral
    +1 more
  • Amazon logoAsked at Amazon 
    6 answers

    "1) select avg(session) from table where session> 180 2) select round(sessiontime/300)*300 as sessionbin, count() as sessioncount from table group by round(sessiontime/300)300 order by session_bin 3) SELECT t1.country AS country_a, t2.country AS country_b FROM ( SELECT country, COUNT(*) AS session_count FROM yourtablename GROUP BY country ) AS t1 JOIN ( SELECT country, COUNT(*) AS session_count FROM yourtablename `GROUP BY countr"

    Erjan G. - "1) select avg(session) from table where session> 180 2) select round(sessiontime/300)*300 as sessionbin, count() as sessioncount from table group by round(sessiontime/300)300 order by session_bin 3) SELECT t1.country AS country_a, t2.country AS country_b FROM ( SELECT country, COUNT(*) AS session_count FROM yourtablename GROUP BY country ) AS t1 JOIN ( SELECT country, COUNT(*) AS session_count FROM yourtablename `GROUP BY countr"See full answer

    Data Scientist
    Coding
    +4 more
  • Adobe logoAsked at Adobe 
    18 answers
    Video answer for 'Given an nxn grid of 1s and 0s, return the number of islands in the input.'
    +15

    " from typing import List def getnumberof_islands(binaryMatrix: List[List[int]]) -> int: if not binaryMatrix: return 0 rows = len(binaryMatrix) cols = len(binaryMatrix[0]) islands = 0 for r in range(rows): for c in range(cols): if binaryMatrixr == 1: islands += 1 dfs(binaryMatrix, r, c) return islands def dfs(grid, r, c): if ( r = len(grid) "

    Rick E. - " from typing import List def getnumberof_islands(binaryMatrix: List[List[int]]) -> int: if not binaryMatrix: return 0 rows = len(binaryMatrix) cols = len(binaryMatrix[0]) islands = 0 for r in range(rows): for c in range(cols): if binaryMatrixr == 1: islands += 1 dfs(binaryMatrix, r, c) return islands def dfs(grid, r, c): if ( r = len(grid) "See full answer

    Data Scientist
    Data Structures & Algorithms
    +4 more
  • Adobe logoAsked at Adobe 
    Add answer
    Video answer for 'Find the median of two sorted arrays.'
    Data Scientist
    Data Structures & Algorithms
    +4 more
  • 🧠 Want an expert answer to a question? Saving questions lets us know what content to make next.

  • 14 answers
    +10

    " select user_id, b.marketing_channel from user_sessions a Left join attribution b on b.sessionid = a.sessionid group by 1,2 HAVING sum(purchasevalue)>100 and min(adclick_timestamp) `"

    G B. - " select user_id, b.marketing_channel from user_sessions a Left join attribution b on b.sessionid = a.sessionid group by 1,2 HAVING sum(purchasevalue)>100 and min(adclick_timestamp) `"See full answer

    Data Scientist
    Coding
    +3 more
  • Apple logoAsked at Apple 
    17 answers
    +12

    " class ListNode: def init(self, val=0, next=None): self.val = val self.next = next def has_cycle(head: ListNode) -> bool: pass # your code goes here if head is None: return False previousNodes = set() iter = head while iter: if iter.val in previousNodes: return True previousNodes.add(iter.val) iter = iter.next; return False debug your code below node1 = ListNode(1) node2 = ListNode(2) n"

    Cagdas A. - " class ListNode: def init(self, val=0, next=None): self.val = val self.next = next def has_cycle(head: ListNode) -> bool: pass # your code goes here if head is None: return False previousNodes = set() iter = head while iter: if iter.val in previousNodes: return True previousNodes.add(iter.val) iter = iter.next; return False debug your code below node1 = ListNode(1) node2 = ListNode(2) n"See full answer

    Data Scientist
    Data Structures & Algorithms
    +4 more
  • "clarify: so does the 5% drop a sudden drop or overtime in the one week does it broadly drop 5% or it dropped only in some regions or in some segments like new acqusition / frequent active customers? or does the 5% drop also happened last year same period? DAU = acqusition x activation x retention segment: I will first quickly do some EDA to find out problem, like calculate the DAU drop in new customer, tenured customer, between regions to find out is there any difference. then I will also look"

    Yuexiang Y. - "clarify: so does the 5% drop a sudden drop or overtime in the one week does it broadly drop 5% or it dropped only in some regions or in some segments like new acqusition / frequent active customers? or does the 5% drop also happened last year same period? DAU = acqusition x activation x retention segment: I will first quickly do some EDA to find out problem, like calculate the DAU drop in new customer, tenured customer, between regions to find out is there any difference. then I will also look"See full answer

    Data Scientist
    Analytical
  • Microsoft logoAsked at Microsoft 
    2 answers

    "In the Transformer architecture, the decoder differs from the encoder primarily in its additional mechanisms designed to handle autoregressive sequence generation. Here's a breakdown of the key differences: Self-Attention Mechanism: Encoder: The encoder has a standard self-attention mechanism that allows each token to attend to all other tokens in the input sequence. Decoder: The decoder has two types of self-attention. The first is the same as in the encoder, but the second is mas"

    Ranj A. - "In the Transformer architecture, the decoder differs from the encoder primarily in its additional mechanisms designed to handle autoregressive sequence generation. Here's a breakdown of the key differences: Self-Attention Mechanism: Encoder: The encoder has a standard self-attention mechanism that allows each token to attend to all other tokens in the input sequence. Decoder: The decoder has two types of self-attention. The first is the same as in the encoder, but the second is mas"See full answer

    Data Scientist
    Statistics & Experimentation
  • CVS logoAsked at CVS 
    1 answer

    "Situation (S) I was leading the greenfield development of an AI-driven Supply Chain Control Tower for a major medical equipment manufacturer. The business priority was "First Mover Advantage," requiring a Beta release for three pilot hospitals within a compressed 5-month timeline. To hit this window, we initially chose a Low-Code (Power Pages) architecture for its auto-scaling and rapid deployment capabilities. Task (T) Post-Beta, the Marketing team rejected the UI/"

    Apurv M. - "Situation (S) I was leading the greenfield development of an AI-driven Supply Chain Control Tower for a major medical equipment manufacturer. The business priority was "First Mover Advantage," requiring a Beta release for three pilot hospitals within a compressed 5-month timeline. To hit this window, we initially chose a Low-Code (Power Pages) architecture for its auto-scaling and rapid deployment capabilities. Task (T) Post-Beta, the Marketing team rejected the UI/"See full answer

    Data Scientist
    Behavioral
  • Adobe logoAsked at Adobe 
    12 answers
    +8

    " function findPrimes(n) { if (n 1; i--) { if (num % i === 0) { notPrimes.add(num); return false; } } return true; } for (let i = 2; i 5 && !notPr"

    Jeff S. - " function findPrimes(n) { if (n 1; i--) { if (num % i === 0) { notPrimes.add(num); return false; } } return true; } for (let i = 2; i 5 && !notPr"See full answer

    Data Scientist
    Data Structures & Algorithms
    +4 more
  • Google logoAsked at Google 
    4 answers
    +1

    "The distribution of daily search queries per user, as shown in the histogram, can be described as approximately normal (or bell-shaped) with a slight positive skew. Key Characteristics: Shape: The distribution is roughly symmetrical around its center, resembling a bell curve. This indicates that most users perform a moderate number of daily search queries. Central Tendency: The peak of the distribution, representing the highest density of users, appears to be around **8"

    Sam A. - "The distribution of daily search queries per user, as shown in the histogram, can be described as approximately normal (or bell-shaped) with a slight positive skew. Key Characteristics: Shape: The distribution is roughly symmetrical around its center, resembling a bell curve. This indicates that most users perform a moderate number of daily search queries. Central Tendency: The peak of the distribution, representing the highest density of users, appears to be around **8"See full answer

    Data Scientist
    Statistics & Experimentation
  • Airbnb logoAsked at Airbnb 
    Add answer
    Data Scientist
    Data Structures & Algorithms
    +4 more
  • "If you effectively listen and understand their point of view, then take action to address the issue quickly. Don't let too much time slip between the conflict and the resolution. If resolving the concern will take more time, communicate the current status and next steps with the stakeholder."

    Abdurhman M. - "If you effectively listen and understand their point of view, then take action to address the issue quickly. Don't let too much time slip between the conflict and the resolution. If resolving the concern will take more time, communicate the current status and next steps with the stakeholder."See full answer

    Data Scientist
    Behavioral
    +1 more
  • 15 answers
    +12

    " with youngsuccrate as( select strftime('%m', postdate) AS postmonth, round(sum(issuccessfulpost)*1.0/count(issuccessfulpost),2)as yascrate from post where userid in (select userid from post_user where age between 0 and 18) group by post_month ), nonyoungsucc_rate as( select strftime('%m', postdate) AS postmonth, round(sum(issuccessfulpost)*1.0/count(issuccessfulpost),2)as nonyasc_rate from post where user_id in (select"

    Bhavna S. - " with youngsuccrate as( select strftime('%m', postdate) AS postmonth, round(sum(issuccessfulpost)*1.0/count(issuccessfulpost),2)as yascrate from post where userid in (select userid from post_user where age between 0 and 18) group by post_month ), nonyoungsucc_rate as( select strftime('%m', postdate) AS postmonth, round(sum(issuccessfulpost)*1.0/count(issuccessfulpost),2)as nonyasc_rate from post where user_id in (select"See full answer

    Data Scientist
    Coding
    +3 more
  • Stripe logoAsked at Stripe 
    Add answer
    Video answer for 'How can Stripe use data to predict the optimal time to retry a transaction?'
    Data Scientist
    Analytical
    +1 more
  • OpenAI logoAsked at OpenAI 
    Add answer
    Data Scientist
    Behavioral
    +5 more
  • "Goals : Determine if the TV series should be renewed If it should be renewed, how much should Netflix be willing to pay for this series Let's assume that the goal is to maximize subscriber retention and engagement while paying a reasonable amount for the licensing costs that is justified by the value added by the series. Assumptions : The show is exclusive to Netflix for a particular region (for eg. US) It has been on the platform for an year Netflix has subscriber level data around"

    Saurabh K. - "Goals : Determine if the TV series should be renewed If it should be renewed, how much should Netflix be willing to pay for this series Let's assume that the goal is to maximize subscriber retention and engagement while paying a reasonable amount for the licensing costs that is justified by the value added by the series. Assumptions : The show is exclusive to Netflix for a particular region (for eg. US) It has been on the platform for an year Netflix has subscriber level data around"See full answer

    Data Scientist
    Data Analysis
Showing 61-80 of 174