Data Scientist Interview Questions

Review this list of 174 Data Scientist interview questions and answers verified by hiring managers and candidates.

+ Share interview

Asked at Adobe, Apple, Intuit + 3 more • a year ago
Sudoku Solver
IDE
Hard
Data Scientist
Data Structures & Algorithms
+4 more
4 answers
+1
"static boolean sudokuSolve(char board) { return sudokuSolve(board, 0, 0); } static boolean sudokuSolve(char board, int r, int c) { if(c>=board[0].length) { r=r+1; c=0; } if(r>=board.length) return true; if(boardr=='.') { for(int num=1; num<=9; num++) { boardr=(char)('0' + num); if(isValidPosition(board, r, c)) { if(sudokuSolve(board, r, c+1)) return true; } boardr='.'; } } else { return sudokuSolve(board, r, c+1); } return false; } static boolean isValidPosition(char b"
Divya R. - "static boolean sudokuSolve(char board) { return sudokuSolve(board, 0, 0); } static boolean sudokuSolve(char board, int r, int c) { if(c>=board[0].length) { r=r+1; c=0; } if(r>=board.length) return true; if(boardr=='.') { for(int num=1; num<=9; num++) { boardr=(char)('0' + num); if(isValidPosition(board, r, c)) { if(sudokuSolve(board, r, c+1)) return true; } boardr='.'; } } else { return sudokuSolve(board, r, c+1); } return false; } static boolean isValidPosition(char b"See full answer
Data Scientist
Data Structures & Algorithms
+4 more
Asked at TikTok, Valve • 3 years ago
As the data scientist, interpreting a significant increase in revenue from a new feature in one of 20 countries, what would you recommend?
Data Scientist
Analytical
2 answers
"too much discussing on p-value…. and theoritical things…. country are independant…."
Brook - "too much discussing on p-value…. and theoritical things…. country are independant…."See full answer
Data Scientist
Analytical
Fraudulent Transactions
IDE
Medium
Data Scientist
Coding
+3 more
6 answers
+3
"WITH suspicious_transactions AS ( SELECT c.first_name, c.last_name, t.receipt_number, COUNT(t.receiptnumber) OVER (PARTITION BY c.customerid) AS noofoffences FROM customers c JOIN transactions t ON c.customerid = t.customerid WHERE t.receipt_number LIKE '%999%' OR t.receipt_number LIKE '%1234%' OR t.receipt_number LIKE '%XYZ%' ) SELECT first_name, last_name, receipt_number, noofoffences FROM suspicious_transactions WHERE noofoffences >= 2;"
Jayveer S. - "WITH suspicious_transactions AS ( SELECT c.first_name, c.last_name, t.receipt_number, COUNT(t.receiptnumber) OVER (PARTITION BY c.customerid) AS noofoffences FROM customers c JOIN transactions t ON c.customerid = t.customerid WHERE t.receipt_number LIKE '%999%' OR t.receipt_number LIKE '%1234%' OR t.receipt_number LIKE '%XYZ%' ) SELECT first_name, last_name, receipt_number, noofoffences FROM suspicious_transactions WHERE noofoffences >= 2;"See full answer
Data Scientist
Coding
+3 more
Asked at Meta • a year ago
How can you improve Facebook’s DAU?
Data Scientist
Analytical
Add answer
Data Scientist
Analytical
Asked at OpenAI • 5 months ago
Metrics moved in different directions, how do you interpret the results and decide next steps?
Data Scientist
Statistics & Experimentation
Add answer
Data Scientist
Statistics & Experimentation

🧠 Want an expert answer to a question? Saving questions lets us know what content to make next.

Asked at Adobe, Apple, Google + 1 more • a year ago
Permutations
IDE
Medium
Data Scientist
Data Structures & Algorithms
+3 more
4 answers
+1
"function permute(nums) { if (nums.length <= 1) { return [nums]; } const prevPermutations = permute(nums.slice(0, nums.length-1)); const currentNum = nums[nums.length-1]; const permutations = new Set(); for (let prev of prevPermutations) { for (let i=0; i < prev.length; i++) { permutations.add([...prev.slice(0, i), currentNum, ...prev.slice(i)]); } permutations.add([...prev, currentNum]); } return [...permutations]"
Tiago R. - "function permute(nums) { if (nums.length <= 1) { return [nums]; } const prevPermutations = permute(nums.slice(0, nums.length-1)); const currentNum = nums[nums.length-1]; const permutations = new Set(); for (let prev of prevPermutations) { for (let i=0; i < prev.length; i++) { permutations.add([...prev.slice(0, i), currentNum, ...prev.slice(i)]); } permutations.add([...prev, currentNum]); } return [...permutations]"See full answer
Data Scientist
Data Structures & Algorithms
+3 more
Asked at Discord, Two Sigma • 19 days ago
What other companies are you interviewing at and why?
Data Scientist
Behavioral
+4 more
Add answer
Data Scientist
Behavioral
+4 more
Asked at Cognition AI, Figma, Traba • 5 months ago
What motivates you?
Data Scientist
Behavioral
+3 more
Add answer
Data Scientist
Behavioral
+3 more
Asked at SAP • 3 years ago
Design a system capable of identifying ships that deviate from their course using a dataset that tracks ship positions, recorded as tuples containing (ship_ID, x, y, z, timestamp), with irregular t...
Data Scientist
System Design
1 answer
"To handle the non-uniform sampling, I'd first clean and divide the dataset into chunks of n second interval 'uniform' trajectory data(e.g. 5s or 10s trajectories). This gives us a cleaner trajectory data chunks, T, of format (ship_ID, x, y, z, timestamp) to be formed. For the system itself, I'd use a generative model, e.g. Variational AutoEncoder (VAE), and train the model's 'encoder' to produce a latent-space representation of input features (x,y,z,timestamp) from T, and it's 'decoder' to pred"
Anonymous Hornet - "To handle the non-uniform sampling, I'd first clean and divide the dataset into chunks of n second interval 'uniform' trajectory data(e.g. 5s or 10s trajectories). This gives us a cleaner trajectory data chunks, T, of format (ship_ID, x, y, z, timestamp) to be formed. For the system itself, I'd use a generative model, e.g. Variational AutoEncoder (VAE), and train the model's 'encoder' to produce a latent-space representation of input features (x,y,z,timestamp) from T, and it's 'decoder' to pred"See full answer
Data Scientist
System Design
Asked at OpenAI • 5 months ago
How do you design an experiment to avoid common pitfalls in interpreting results?
Data Scientist
Statistics & Experimentation
Add answer
Data Scientist
Statistics & Experimentation
Asked at Meta • 5 years ago
How would you help the Instagram team decide whether to launch the Rooms feature after a successful launch on Facebook?
Data Scientist
Add answer
Data Scientist
Asked at Tinder • 2 years ago
Tinder subscriptions renew monthly. Explain why different months may have different numbers of renewals.
Data Scientist
Technical
1 answer
"Clarification question: How many subscription plans are offered by Tinder ? If there is more than one subscription plan, then we need to ask is the fluctuation happening across all plans or in a particular one ? Assumption: Let's say lower priced subscription plan is showing the most fluctuation and there are only two types of plans In this subscription plan which age group is showing the most fluctuation (18-24,25-30, 30+ etc) ? Is there any seasonality trend observed (eg: placemen"
Srijita P. - "Clarification question: How many subscription plans are offered by Tinder ? If there is more than one subscription plan, then we need to ask is the fluctuation happening across all plans or in a particular one ? Assumption: Let's say lower priced subscription plan is showing the most fluctuation and there are only two types of plans In this subscription plan which age group is showing the most fluctuation (18-24,25-30, 30+ etc) ? Is there any seasonality trend observed (eg: placemen"See full answer
Data Scientist
Technical
Asked at Microsoft • 2 years ago
Given a list of numbers, find the median without sorting the entire list. Hint: Use quick sort algorithm.
Data Scientist
Coding
Add answer
Data Scientist
Coding
Asked at Walmart Labs • a year ago
Why do you want to work at Walmart Labs?
Data Scientist
Behavioral
+5 more
Add answer
Data Scientist
Behavioral
+5 more
Asked at Meta • a year ago
A user advocacy group raises concerns about accessibility for individuals with hearing disabilities. What are some product improvements for Facebook Live and Videos, and how would you define succes...
Data Scientist
Execution
Add answer
Data Scientist
Execution
Asked at Amazon • a year ago
Hypothesis Testing: Suppose a PM claims that users, on average, spend about $50 per month on Amazon. However, you doubt this claim and believe the average should be higher. You sample 100 users and...
Data Scientist
Statistics & Experimentation
1 answer
"I would conduct a sample z-test because we have enough samples and the population variance is known. H1: average monthly spending per user is $50 H0: average monthly spending per user is greater $50 One-sample z-test x_bar = $85 mu = $50 s = $20 n = 100 x_bar - mu / (s / sqrt(n) = 17.5 17.5 is the z-score that we will need to associate with its corresponding p-value. However, the z-score is very high, so the p-value will be very close to zero, which is much less than the standa"
Lucas G. - "I would conduct a sample z-test because we have enough samples and the population variance is known. H1: average monthly spending per user is $50 H0: average monthly spending per user is greater $50 One-sample z-test x_bar = $85 mu = $50 s = $20 n = 100 x_bar - mu / (s / sqrt(n) = 17.5 17.5 is the z-score that we will need to associate with its corresponding p-value. However, the z-score is very high, so the p-value will be very close to zero, which is much less than the standa"See full answer
Data Scientist
Statistics & Experimentation
Asked at Meta • 5 years ago
Would you port Facebook rooms to Instagram?
Data Scientist
Product Strategy
Add answer
Data Scientist
Product Strategy
Asked at AstraZeneca, Nvidia • a month ago
Tell me about your experience working with scientists.
Data Scientist
Behavioral
+1 more
1 answer
"I don't have experience working with alot of Biological Scientists. Most of my experience comes with Data Scientists. Described how I used ideation techniques like brainstorming and other creative ways to get people to find common ground. I also mentioned how I like to do survey's before meetings to prompt people and also get unbiased opnions"
Mark M. - "I don't have experience working with alot of Biological Scientists. Most of my experience comes with Data Scientists. Described how I used ideation techniques like brainstorming and other creative ways to get people to find common ground. I also mentioned how I like to do survey's before meetings to prompt people and also get unbiased opnions"See full answer
Data Scientist
Behavioral
+1 more
Asked at McKinsey • a year ago
One of your clients has experienced a 20% decline in profits. What would you do?
Data Scientist
Analytical
+1 more
1 answer
"Spoiled food In a process I improved, I streamlined how tasks were assigned to reduce delays and confusion."
Ruth A. - "Spoiled food In a process I improved, I streamlined how tasks were assigned to reduce delays and confusion."See full answer
Data Scientist
Analytical
+1 more
Asked at McKinsey • a year ago
In what cases should you use the median instead of the mean?
Data Scientist
Statistics & Experimentation
1 answer
"The cases where data is under heavy outlier influence. Since mean fluctuates due to the presence of an outlier, median might be a better measure"
Himani E. - "The cases where data is under heavy outlier influence. Since mean fluctuates due to the presence of an outlier, median might be a better measure"See full answer
Data Scientist
Statistics & Experimentation

Showing 121-140 of 174

Interviewed recently?

Help improve our question database (and earn karma) by telling us about your experience

+ Share interview experience

Trending companies

Data Scientist Interview Questions

Sudoku Solver

As the data scientist, interpreting a significant increase in revenue from a new feature in one of 20 countries, what would you recommend?

Fraudulent Transactions

How can you improve Facebook’s DAU?

Metrics moved in different directions, how do you interpret the results and decide next steps?

Permutations

What other companies are you interviewing at and why?

What motivates you?

Design a system capable of identifying ships that deviate from their course using a dataset that tracks ship positions, recorded as tuples containing (ship_ID, x, y, z, timestamp), with irregular t...

How do you design an experiment to avoid common pitfalls in interpreting results?

How would you help the Instagram team decide whether to launch the Rooms feature after a successful launch on Facebook?

Tinder subscriptions renew monthly. Explain why different months may have different numbers of renewals.

Given a list of numbers, find the median without sorting the entire list. Hint: Use quick sort algorithm.

Why do you want to work at Walmart Labs?

A user advocacy group raises concerns about accessibility for individuals with hearing disabilities. What are some product improvements for Facebook Live and Videos, and how would you define succes...

Hypothesis Testing: Suppose a PM claims that users, on average, spend about $50 per month on Amazon. However, you doubt this claim and believe the average should be higher. You sample 100 users and...

Would you port Facebook rooms to Instagram?

Tell me about your experience working with scientists.

One of your clients has experienced a 20% decline in profits. What would you do?

In what cases should you use the median instead of the mean?

Explore questions by company

Explore questions by role

Follow Us

Products

Courses

Interview Questions

Interview Experiences

Popular articles

Guides

Coaching

For Partners

Company