Data Analysis Interview Questions

Review this list of 101 data analysis interview questions and answers verified by hiring managers and candidates.

+ Add interview

Product

Engineering

Operations

Design

Marketing

Data

Sales

Finance

Consulting

Add interview

Data Analyst Product Analyst Business Analyst Data Scientist BizOps & Strategy Product Manager

Asked at Amazon • 9 months ago
How would you visualize sales and customer usage?
Business Analyst
Data Analysis
+2 more
1 answer I was asked this
"We want sales to grow, in order to have a growth in revenue. And customer usage as well as it allows to see if our product lead more engagement from our users. So to be able to see this overall evolution I would make a line chart for both : Sales : with month on x-axis and sales revenue on y-axis Customer Usage : with month on x-axis and a KPI allowing to measure customer usage (nblogins or nbsessions or nbgamesplayed, ... depending on the industry) on y-axis Moreover, after knowing th"
Catherine T. - "We want sales to grow, in order to have a growth in revenue. And customer usage as well as it allows to see if our product lead more engagement from our users. So to be able to see this overall evolution I would make a line chart for both : Sales : with month on x-axis and sales revenue on y-axis Customer Usage : with month on x-axis and a KPI allowing to measure customer usage (nblogins or nbsessions or nbgamesplayed, ... depending on the industry) on y-axis Moreover, after knowing th"See full answer
Business Analyst
Data Analysis
+2 more
You’re launching a new feature—how would you measure its strategic impact on growth?
Data Analyst
Data Analysis
+2 more
1 answer I was asked this
"First, I would start by defining what growth means in the context of this new feature whether it's user acquisition, engagement, retention, or revenue. Next, I’d identify clear KPIs that directly align with that growth goal. For example, if the feature aims to improve engagement, I’d track metrics like daily active users, session duration, or feature adoption rate. Once the KPIs are in place, I’d run an A/B test comparing user behavior with and without the feature. This would be followed by de"
Himanshu G. - "First, I would start by defining what growth means in the context of this new feature whether it's user acquisition, engagement, retention, or revenue. Next, I’d identify clear KPIs that directly align with that growth goal. For example, if the feature aims to improve engagement, I’d track metrics like daily active users, session duration, or feature adoption rate. Once the KPIs are in place, I’d run an A/B test comparing user behavior with and without the feature. This would be followed by de"See full answer
Data Analyst
Data Analysis
+2 more
Tell me about a time when the business problem wasn't clearly defined—how did you handle it?
Data Analyst
Data Analysis
+3 more
1 answer I was asked this
"Investigation clear understanding the true cause"
Akash D. - "Investigation clear understanding the true cause"See full answer
Data Analyst
Data Analysis
+3 more
Asked at Stripe • 4 years ago
How can Stripe use data to predict the optimal time to retry a transaction?
Data Scientist
Data Analysis
+1 more
Add answer I was asked this
Data Scientist
Data Analysis
+1 more
How do you approach an ambiguous request from a stakeholder?
Data Analyst
Data Analysis
+2 more
Add answer I was asked this
Data Analyst
Data Analysis
+2 more

🧠 Want an expert answer to a question? Saving questions lets us know what content to make next.

Unsold Products
IDE
Easy
Data Scientist
Data Analysis
+2 more
8 answers I was asked this
+4
" import pandas as pd def findunsoldproducts(transactions: pd.DataFrame, products: pd.DataFrame) -> pd.DataFrame: Extract purchased product IDs purchasedproductids = transactions['product_id'].unique() Filter products that have never been purchased unsoldproducts = products[~products['id'].isin(purchasedproduct_ids)] Select the desired columns result = unsold_products[['id', 'name', 'stock']] Sort the result by product ID in ascending order"
Gowtham B. - " import pandas as pd def findunsoldproducts(transactions: pd.DataFrame, products: pd.DataFrame) -> pd.DataFrame: Extract purchased product IDs purchasedproductids = transactions['product_id'].unique() Filter products that have never been purchased unsoldproducts = products[~products['id'].isin(purchasedproduct_ids)] Select the desired columns result = unsold_products[['id', 'name', 'stock']] Sort the result by product ID in ascending order"See full answer
Data Scientist
Data Analysis
+2 more
You have three ideas for expansion but limited resources. How would you help prioritize them?
Data Analyst
Data Analysis
+2 more
1 answer I was asked this
"First, I’d start by checking the alignment of each idea with our core business goals. If any idea doesn't directly contribute to those goals, I’d deprioritize or eliminate it upfront. Next, I’d use a scoring model like RICE (Reach, Impact, Confidence, Effort), especially because effort is a critical factor when resources are limited. This gives us a structured and quantifiable way to rank the ideas. Once we have a prioritized list based on scores, I’d take it a step further and evaluate key as"
Himanshu G. - "First, I’d start by checking the alignment of each idea with our core business goals. If any idea doesn't directly contribute to those goals, I’d deprioritize or eliminate it upfront. Next, I’d use a scoring model like RICE (Reach, Impact, Confidence, Effort), especially because effort is a critical factor when resources are limited. This gives us a structured and quantifiable way to rank the ideas. Once we have a prioritized list based on scores, I’d take it a step further and evaluate key as"See full answer
Data Analyst
Data Analysis
+2 more
Asked at DoorDash • 4 months ago
Tell me about a time when you had a hypothesis that turned out to be wrong.
BizOps & Strategy
Data Analysis
+1 more
Add answer I was asked this
BizOps & Strategy
Data Analysis
+1 more
Revenue by Customer City
IDE
Medium
Data Scientist
Data Analysis
+2 more
6 answers I was asked this
+3
"Hi, my solution gives the exact numerical values as the proposed solution, but it doesn't pass the tests. Am I missing something, or is this a bug? def findrevenueby_city(transactions: pd.DataFrame, users: pd.DataFrame, exchange_rate: pd.DataFrame) -> pd.DataFrame: gets user city for each user id userids = users[['id', 'usercity']] and merge on transactions transactions = transactions.merge(user_ids, how='left"
Gabriel P. - "Hi, my solution gives the exact numerical values as the proposed solution, but it doesn't pass the tests. Am I missing something, or is this a bug? def findrevenueby_city(transactions: pd.DataFrame, users: pd.DataFrame, exchange_rate: pd.DataFrame) -> pd.DataFrame: gets user city for each user id userids = users[['id', 'usercity']] and merge on transactions transactions = transactions.merge(user_ids, how='left"See full answer
Data Scientist
Data Analysis
+2 more
Top Product Lines
IDE
Medium
Data Scientist
Data Analysis
+2 more
4 answers I was asked this
+1
"Schema is wrong - id from product is mapped to id from transactions, id from product should point to product_id in transcations table"
Arshad P. - "Schema is wrong - id from product is mapped to id from transactions, id from product should point to product_id in transcations table"See full answer
Data Scientist
Data Analysis
+2 more
Walk me through your process for cleaning a messy dataset.
Data Analyst
Data Analysis
+2 more
Add answer I was asked this
Data Analyst
Data Analysis
+2 more
Have you ever had to work with poor-quality data or suggest new tracking?
Data Analyst
Data Analysis
+2 more
Add answer I was asked this
Data Analyst
Data Analysis
+2 more
Asked at Snap • 4 years ago
How would you use data to help Snap engineering improve phone camera speed?
Data Analysis
Analytical
1 answer I was asked this
Data Analysis
Analytical
How do you handle joining data from different sources with inconsistent IDs?
Data Analyst
Data Analysis
+2 more
Add answer I was asked this
Data Analyst
Data Analysis
+2 more
What questions would you ask before starting an analysis?
Data Analyst
Data Analysis
+2 more
Add answer I was asked this
Data Analyst
Data Analysis
+2 more
How would you optimize fulfillment time for grocery deliveries in high-traffic zones?
Data Analyst
Data Analysis
+2 more
Add answer I was asked this
Data Analyst
Data Analysis
+2 more
How would you evaluate whether a new feature is worth rolling out to all users?
Data Analyst
Data Analysis
+2 more
Add answer I was asked this
Data Analyst
Data Analysis
+2 more
Improving Students
IDE
Hard
Data Analyst
Data Analysis
+2 more
3 answers I was asked this
" import pandas as pd def findimprovingstudents(transcript: pd.DataFrame) -> pd.DataFrame: summary = transcript.pivottable(index='studentid', values = 'yearlygpa', aggfunc = 'sum',columns = 'year').resetindex() summary['average_gpa'] = round((summary[2023] + summary[2022] + summary[2021])/3,2) return summary(summary[2023] > summary[2022]) & (summary[2022] > summary[2021])] #yn > yn-1, yn-1 > yn-2, yn-3 debug your co"
Caleb S. - " import pandas as pd def findimprovingstudents(transcript: pd.DataFrame) -> pd.DataFrame: summary = transcript.pivottable(index='studentid', values = 'yearlygpa', aggfunc = 'sum',columns = 'year').resetindex() summary['average_gpa'] = round((summary[2023] + summary[2022] + summary[2021])/3,2) return summary(summary[2023] > summary[2022]) & (summary[2022] > summary[2021])] #yn > yn-1, yn-1 > yn-2, yn-3 debug your co"See full answer
Data Analyst
Data Analysis
+2 more
Asked at Meta (Facebook) • 4 years ago
Given an array of integers, print all subarrays that sum to zero.
Data Analysis
Data Structures & Algorithms
+1 more
2 answers I was asked this
"sum of continuous subarray and keep checking if arr[i]==arr[j]. if true increase count;"
Rishabh R. - "sum of continuous subarray and keep checking if arr[i]==arr[j]. if true increase count;"See full answer
Data Analysis
Data Structures & Algorithms
+1 more
A newly launched mobile app has a 25% drop-off during sign-up. How would you investigate the issue?
Data Scientist
Data Analysis
+1 more
Add answer I was asked this
Data Scientist
Data Analysis
+1 more