Skip to main content

Find Statistical Evidence for Conversion Rate

MediumPremium

During your interview loop, you may receive coding questions related to statistics, data manipulation, machine learning, or software engineering.

In this video, we'll walk through a statistics-based coding question that you could receive in a technical screen. This question comprises of several parts, each intended to assess your understanding of statistical concepts and your ability to apply these concepts using programming skills.

Practice this interview question in your preferred .ipynb environment.

Dataset

You have been provided with a dataset containing information on user interactions, categorized into two columns: geo and convert.

  • The geo column indicates the user's state with state abbreviations (e.g., TX for Texas, CA for California).
  • The convert column is a boolean value (True or False) that denotes whether the user converted (took a desired action) or not.

You can download the dataset from the provided link.

Task

Calculate the conversion rate for both the experiment groups (users from "Michigan" (MI), "Texas" (TX), or "Washington" (WA)) and the control group (users from all other states). Then, assess whether the difference in conversion rates between these groups is statistically significant.