Skip to main content

Data Engineer Interview Questions

Review this list of 154 Data Engineer interview questions and answers verified by hiring managers and candidates.
  • Adobe logoAsked at Adobe 

    Permutations

    IDE
    Medium

    "function permute(nums) { if (nums.length <= 1) { return [nums]; } const prevPermutations = permute(nums.slice(0, nums.length-1)); const currentNum = nums[nums.length-1]; const permutations = new Set(); for (let prev of prevPermutations) { for (let i=0; i < prev.length; i++) { permutations.add([...prev.slice(0, i), currentNum, ...prev.slice(i)]); } permutations.add([...prev, currentNum]); } return [...permutations]"

    Tiago R. - "function permute(nums) { if (nums.length <= 1) { return [nums]; } const prevPermutations = permute(nums.slice(0, nums.length-1)); const currentNum = nums[nums.length-1]; const permutations = new Set(); for (let prev of prevPermutations) { for (let i=0; i < prev.length; i++) { permutations.add([...prev.slice(0, i), currentNum, ...prev.slice(i)]); } permutations.add([...prev, currentNum]); } return [...permutations]"See full answer

    Data Engineer
    Data Structures & Algorithms
    +3 more
  • Walmart Labs logoAsked at Walmart Labs 

    "I’ve spent over 6 years building and scaling e-commerce products across EMEA and APAC. At Jumia, I led product initiatives on the checkout and payments side. For example, I launched gamified promotions on PDP and checkout that improved engagement and delivered a 2.3x uplift in conversion. I also introduced automated installment payments and order cancellation flows, which not only improved user trust but also reduced complaints by 30% and lowered operational costs. Before that, at Lazada, I work"

    Rajeev K. - "I’ve spent over 6 years building and scaling e-commerce products across EMEA and APAC. At Jumia, I led product initiatives on the checkout and payments side. For example, I launched gamified promotions on PDP and checkout that improved engagement and delivered a 2.3x uplift in conversion. I also introduced automated installment payments and order cancellation flows, which not only improved user trust but also reduced complaints by 30% and lowered operational costs. Before that, at Lazada, I work"See full answer

    Data Engineer
    Behavioral
    +2 more
  • Adobe logoAsked at Adobe 

    "func isMatch(text: String, pattern: String) -> Bool { // Convert strings to arrays for easier indexing let s = Array(text.characters) let p = Array(pattern.characters) guard !s.isEmpty && !p.isEmpty else { return true } // Create DP table: dpi represents if s[0...i-1] matches p[0...j-1] var dp = Array(repeating: Array(repeating: false, count: p.count + 1), count: s.count + 1) // Empty pattern matches empty string dp[0]["

    Reno S. - "func isMatch(text: String, pattern: String) -> Bool { // Convert strings to arrays for easier indexing let s = Array(text.characters) let p = Array(pattern.characters) guard !s.isEmpty && !p.isEmpty else { return true } // Create DP table: dpi represents if s[0...i-1] matches p[0...j-1] var dp = Array(repeating: Array(repeating: false, count: p.count + 1), count: s.count + 1) // Empty pattern matches empty string dp[0]["See full answer

    Data Engineer
    Data Structures & Algorithms
    +3 more
  • +1

    "a process can include many threads. good for concurrent and parallel task execution"

    Erjan G. - "a process can include many threads. good for concurrent and parallel task execution"See full answer

    Data Engineer
    Concept
  • Discord logoAsked at Discord 
    Data Engineer
    Behavioral
    +1 more
  • 🧠 Want an expert answer to a question? Saving questions lets us know what content to make next.

  • Apple logoAsked at Apple 
    +5

    "Make current as root. 2 while current is not null, if p and q are less than current, go left. If p and q are greater than current, go right. else return current. return null"

    Vaibhav D. - "Make current as root. 2 while current is not null, if p and q are less than current, go left. If p and q are greater than current, go right. else return current. return null"See full answer

    Data Engineer
    Data Structures & Algorithms
    +4 more
  • Data Engineer
    Data Pipeline Design
  • Amazon logoAsked at Amazon 

    "OLTP (Online Transaction Processing) and OLAP (Online Analytical Processing) are two types of data processing systems, each designed for specific purposes in the context of database and data warehouse environments. OLTP (Online Transaction Processing):Purpose: OLTP systems are designed to manage and handle high volumes of transactions, such as inserting, updating, and deleting data. These systems are typically used in day-to-day business operations. Characteristics: Handles small, si"

    Nikunj V. - "OLTP (Online Transaction Processing) and OLAP (Online Analytical Processing) are two types of data processing systems, each designed for specific purposes in the context of database and data warehouse environments. OLTP (Online Transaction Processing):Purpose: OLTP systems are designed to manage and handle high volumes of transactions, such as inserting, updating, and deleting data. These systems are typically used in day-to-day business operations. Characteristics: Handles small, si"See full answer

    Data Engineer
    Technical
    +1 more
  • "simply check its size if the size if the size is greater than n then yes it has duplicate"

    Kunal kumar S. - "simply check its size if the size if the size is greater than n then yes it has duplicate"See full answer

    Data Engineer
    Data Structures & Algorithms
    +2 more
  • Salesforce logoAsked at Salesforce 
    Data Engineer
    Behavioral
    +4 more
  • "na"

    Sonveer K. - "na"See full answer

    Data Engineer
    Data Modeling
  • Databricks logoAsked at Databricks 

    "Delta lake is a metadata layer on top of cloud storage which helps giving datalake transactional capabilities. It helps implement upsert/merge as it conforms a schema to the data assets stored in cloud. It also offers various other capabilities like liquid clustering,time travel, schema evolution,deletes."

    Nitish C. - "Delta lake is a metadata layer on top of cloud storage which helps giving datalake transactional capabilities. It helps implement upsert/merge as it conforms a schema to the data assets stored in cloud. It also offers various other capabilities like liquid clustering,time travel, schema evolution,deletes."See full answer

    Data Engineer
    Data Pipeline Design
  • LinkedIn logoAsked at LinkedIn 
    Data Engineer
    Data Modeling
  • Adobe logoAsked at Adobe 
    +6

    "function isPalindrome(s, start, end) { while (s[start] === s[end] && end >= start) { start++; end--; } return end <= start; } function longestPalindromicSubstring(s) { let longestPalindrome = ''; for (let i=0; i < s.length; i++) { let j = s.length-1; while (s[i] !== s[j] && i <= j) { j--; } if (s[i] === s[j]) { if (isPalindrome(s, i, j)) { const validPalindrome = s.substring(i, j+1"

    Tiago R. - "function isPalindrome(s, start, end) { while (s[start] === s[end] && end >= start) { start++; end--; } return end <= start; } function longestPalindromicSubstring(s) { let longestPalindrome = ''; for (let i=0; i < s.length; i++) { let j = s.length-1; while (s[i] !== s[j] && i <= j) { j--; } if (s[i] === s[j]) { if (isPalindrome(s, i, j)) { const validPalindrome = s.substring(i, j+1"See full answer

    Data Engineer
    Data Structures & Algorithms
    +3 more
  • Data Engineer
    Data Modeling
  • Discord logoAsked at Discord 
    Data Engineer
    Behavioral
    +2 more
  • "i said there is hashed, clustered, non-clustered"

    Erjan G. - "i said there is hashed, clustered, non-clustered"See full answer

    Data Engineer
    Technical
  • Data Engineer
    Data Pipeline Design
  • Data Engineer
    Data Pipeline Design
Showing 121-140 of 154