Home Questions Data Scientist Statistics & Experimentation

Data Scientist Statistics & Experimentation Interview Questions

Review this list of 11 statistics & experimentation data scientist interview questions and answers verified by hiring managers and candidates.

+ Add interview

Product

Engineering

Operations

Design

Marketing

Data

Sales

Finance

Consulting

Security

Add interview

Data Scientist Product Analyst Machine Learning Engineer Data Engineer Management Consultant Product Manager Software Engineer Business Analyst

Asked at Meta (Facebook) • 4 months ago
You're designing an A/B test to evaluate the impact of showing content from non-friends in users' feeds. How would you test this with proper randomization?
Data Scientist
Statistics & Experimentation
1 answer I was asked this
"Before proceeding, I just wanted to clarify we wanted to check for the impact of showing content from non-friends in users’ feeds, and here non-friends I would assume could be anyone, but mainly like content creators, and I am not including ads here. But I wanted to ask if there is any current logic as to what posts to show based on users' affinity to those posts, maybe basis the user engagement to Insta feed. now objective of this would be to improve the engagement of the platform, as if users"
Dhruv S. - "Before proceeding, I just wanted to clarify we wanted to check for the impact of showing content from non-friends in users’ feeds, and here non-friends I would assume could be anyone, but mainly like content creators, and I am not including ads here. But I wanted to ask if there is any current logic as to what posts to show based on users' affinity to those posts, maybe basis the user engagement to Insta feed. now objective of this would be to improve the engagement of the platform, as if users"See full answer
Data Scientist
Statistics & Experimentation
Asked at Google • 2 months ago
A PM at Google asked you to describe the distribution of daily search queries per user. How would you describe it?
Data Scientist
Statistics & Experimentation
2 answers I was asked this
"The distribution of daily search queries per user, as shown in the histogram, can be described as approximately normal (or bell-shaped) with a slight positive skew. Key Characteristics: Shape: The distribution is roughly symmetrical around its center, resembling a bell curve. This indicates that most users perform a moderate number of daily search queries. Central Tendency: The peak of the distribution, representing the highest density of users, appears to be around **8"
Sam A. - "The distribution of daily search queries per user, as shown in the histogram, can be described as approximately normal (or bell-shaped) with a slight positive skew. Key Characteristics: Shape: The distribution is roughly symmetrical around its center, resembling a bell curve. This indicates that most users perform a moderate number of daily search queries. Central Tendency: The peak of the distribution, representing the highest density of users, appears to be around **8"See full answer
Data Scientist
Statistics & Experimentation
Asked at Lyft • a year ago
A $5 discount coupon is given to N riders. The probability of using a coupon is P. What is the expected cost for the company?
Data Scientist
Statistics & Experimentation
3 answers I was asked this
"Is there a reason a confidence interval was used to solve this problem over just using the mean/expected value directly?"
Aarav G. - "Is there a reason a confidence interval was used to solve this problem over just using the mean/expected value directly?"See full answer
Data Scientist
Statistics & Experimentation
Asked at Microsoft • 9 months ago
In the transformer architecture, what makes the decoder different from the encoder?
Data Scientist
Statistics & Experimentation
2 answers I was asked this
"In the Transformer architecture, the decoder differs from the encoder primarily in its additional mechanisms designed to handle autoregressive sequence generation. Here's a breakdown of the key differences: Self-Attention Mechanism: Encoder: The encoder has a standard self-attention mechanism that allows each token to attend to all other tokens in the input sequence. Decoder: The decoder has two types of self-attention. The first is the same as in the encoder, but the second is mas"
Ranj A. - "In the Transformer architecture, the decoder differs from the encoder primarily in its additional mechanisms designed to handle autoregressive sequence generation. Here's a breakdown of the key differences: Self-Attention Mechanism: Encoder: The encoder has a standard self-attention mechanism that allows each token to attend to all other tokens in the input sequence. Decoder: The decoder has two types of self-attention. The first is the same as in the encoder, but the second is mas"See full answer
Data Scientist
Statistics & Experimentation
Asked at DoorDash • 3 months ago
You're a PM at a food delivery app where conversion rates have declined over the past week. How would you investigate the causes? (Conversion: From users browsing to placing orders.)
Data Scientist
Statistics & Experimentation
+2 more
Add answer I was asked this
Data Scientist
Statistics & Experimentation
+2 more

🧠 Want an expert answer to a question? Saving questions lets us know what content to make next.

Asked at Meta (Facebook), Goldman Sachs, LinkedIn • 9 months ago
Explain Bayes' theorem.
Data Scientist
Statistics & Experimentation
+2 more
3 answers I was asked this
"Is it bad to get the answer a different way? Will they mark that as not knowing Bayes Theorem or just correct as it is an easier way to get the answer? The way I went is to look at what happens when the factory makes 100 light bulbs. Machine A makes 60 of which 3 are faulty, Machine B makes 40 of which 1.2 are faulty. Therefore the pool of faulty lightbulbs is 3/4.2 = 5/7 from machine A and 1.2/4.2 = 3/7 from Machine B."
Will I. - "Is it bad to get the answer a different way? Will they mark that as not knowing Bayes Theorem or just correct as it is an easier way to get the answer? The way I went is to look at what happens when the factory makes 100 light bulbs. Machine A makes 60 of which 3 are faulty, Machine B makes 40 of which 1.2 are faulty. Therefore the pool of faulty lightbulbs is 3/4.2 = 5/7 from machine A and 1.2/4.2 = 3/7 from Machine B."See full answer
Data Scientist
Statistics & Experimentation
+2 more
Asked at Robinhood • 8 months ago
Robinhood is planning to introduce a new feature which allows users to trade fractional shares. How would you decide whether this is a good idea or not?
Data Scientist
Statistics & Experimentation
1 answer I was asked this
"I would use A/B testing to see if the new feature would be incrementally beneficial. To begin the testing, we should define what's the goal of this testing. Let's say the new feature would increase the average number of trade by X. Then randomly assign the clients to two groups, control and test group. Control group doesn't see the new feature and the test group see the new feature. We could also stratified sampling if we want to make sure cover different customer segmentation. During this desig"
Jiin S. - "I would use A/B testing to see if the new feature would be incrementally beneficial. To begin the testing, we should define what's the goal of this testing. Let's say the new feature would increase the average number of trade by X. Then randomly assign the clients to two groups, control and test group. Control group doesn't see the new feature and the test group see the new feature. We could also stratified sampling if we want to make sure cover different customer segmentation. During this desig"See full answer
Data Scientist
Statistics & Experimentation
Asked at Amazon • 2 months ago
Hypothesis Testing: Suppose a PM claims that users, on average, spend about $50 per month on Amazon. However, you doubt this claim and believe the average should be higher. You sample 100 users and...
Data Scientist
Statistics & Experimentation
1 answer I was asked this
"I would conduct a sample z-test because we have enough samples and the population variance is known. H1: average monthly spending per user is $50 H0: average monthly spending per user is greater $50 One-sample z-test x_bar = $85 mu = $50 s = $20 n = 100 x_bar - mu / (s / sqrt(n) = 17.5 17.5 is the z-score that we will need to associate with its corresponding p-value. However, the z-score is very high, so the p-value will be very close to zero, which is much less than the standa"
Lucas G. - "I would conduct a sample z-test because we have enough samples and the population variance is known. H1: average monthly spending per user is $50 H0: average monthly spending per user is greater $50 One-sample z-test x_bar = $85 mu = $50 s = $20 n = 100 x_bar - mu / (s / sqrt(n) = 17.5 17.5 is the z-score that we will need to associate with its corresponding p-value. However, the z-score is very high, so the p-value will be very close to zero, which is much less than the standa"See full answer
Data Scientist
Statistics & Experimentation
Asked at Meta (Facebook) • 2 months ago
A PM at Meta asked you to describe the distribution of daily minutes spent on Facebook per user. How would you describe it?
Data Scientist
Statistics & Experimentation
Add answer I was asked this
Data Scientist
Statistics & Experimentation
Asked at McKinsey • 6 months ago
In what cases should you use the median instead of the mean?
Data Scientist
Statistics & Experimentation
1 answer I was asked this
"The cases where data is under heavy outlier influence. Since mean fluctuates due to the presence of an outlier, median might be a better measure"
Himani E. - "The cases where data is under heavy outlier influence. Since mean fluctuates due to the presence of an outlier, median might be a better measure"See full answer
Data Scientist
Statistics & Experimentation
Asked at Lyft • a year ago
A discount coupon is given to N riders. The probability of using a coupon is P. What is the probability that one of the coupons will be used?
Data Scientist
Statistics & Experimentation
2 answers I was asked this
"Probability that a coupon is used = P Probability that a coupon is not used = 1-P Probability that none of the N coupons are used = (1-P)^N Probability that at least one of the N coupons are used = 1 - (1-P)^N"
Saurabh K. - "Probability that a coupon is used = P Probability that a coupon is not used = 1-P Probability that none of the N coupons are used = (1-P)^N Probability that at least one of the N coupons are used = 1 - (1-P)^N"See full answer
Data Scientist
Statistics & Experimentation

Showing 1-11 of 11

Interviewed recently?

Help improve our question database (and earn karma) by telling us about your experience

Trending companies

Data Scientist Statistics & Experimentation Interview Questions

You're designing an A/B test to evaluate the impact of showing content from non-friends in users' feeds. How would you test this with proper randomization?

A PM at Google asked you to describe the distribution of daily search queries per user. How would you describe it?

A $5 discount coupon is given to N riders. The probability of using a coupon is P. What is the expected cost for the company?

In the transformer architecture, what makes the decoder different from the encoder?

You're a PM at a food delivery app where conversion rates have declined over the past week. How would you investigate the causes? (Conversion: From users browsing to placing orders.)

Explain Bayes' theorem.

Robinhood is planning to introduce a new feature which allows users to trade fractional shares. How would you decide whether this is a good idea or not?

Hypothesis Testing: Suppose a PM claims that users, on average, spend about $50 per month on Amazon. However, you doubt this claim and believe the average should be higher. You sample 100 users and...

A PM at Meta asked you to describe the distribution of daily minutes spent on Facebook per user. How would you describe it?

In what cases should you use the median instead of the mean?

A discount coupon is given to N riders. The probability of using a coupon is P. What is the probability that one of the coupons will be used?

Explore questions by company

Explore questions by role