Data Engineer Interview Questions

Review this list of 105 data engineer interview questions and answers verified by hiring managers and candidates.

+ Add interview

Product

Engineering

Operations

Design

Marketing

Data

Sales

Finance

Consulting

Add interview

Product Manager Software Engineer Technical Program Manager Engineering Manager Data Scientist Data Engineer Machine Learning Engineer Data Analyst BizOps & Strategy Product Analyst

Asked at Walmart Labs • 8 months ago
Tell me about your e-commerce experience.
Data Engineer
Behavioral
+2 more
Add answer I was asked this
Data Engineer
Behavioral
+2 more
Asked at Discord • 8 months ago
Why did you become an engineer?
Data Engineer
Behavioral
+1 more
Add answer I was asked this
Data Engineer
Behavioral
+1 more
Asked at Adobe, Apple, Intuit + 3 more • 8 months ago
Sudoku Solver
IDE
Hard
Data Engineer
Data Structures & Algorithms
+4 more
2 answers I was asked this
"static boolean sudokuSolve(char board) { return sudokuSolve(board, 0, 0); } static boolean sudokuSolve(char board, int r, int c) { if(c>=board[0].length) { r=r+1; c=0; } if(r>=board.length) return true; if(boardr=='.') { for(int num=1; num<=9; num++) { boardr=(char)('0' + num); if(isValidPosition(board, r, c)) { if(sudokuSolve(board, r, c+1)) return true; } boardr='.'; } } else { return sudokuSolve(board, r, c+1); } return false; } static boolean isValidPosition(char b"
Divya R. - "static boolean sudokuSolve(char board) { return sudokuSolve(board, 0, 0); } static boolean sudokuSolve(char board, int r, int c) { if(c>=board[0].length) { r=r+1; c=0; } if(r>=board.length) return true; if(boardr=='.') { for(int num=1; num<=9; num++) { boardr=(char)('0' + num); if(isValidPosition(board, r, c)) { if(sudokuSolve(board, r, c+1)) return true; } boardr='.'; } } else { return sudokuSolve(board, r, c+1); } return false; } static boolean isValidPosition(char b"See full answer
Data Engineer
Data Structures & Algorithms
+4 more
Asked at Adobe, Amazon, Apple + 10 more • 8 months ago
Calculate the trapped rainwater between bars in a given array.
IDE
Hard
Data Engineer
Data Structures & Algorithms
+4 more
10 answers I was asked this
+7
"from typing import List def traprainwater(height: List[int]) -> int: if not height: return 0 l, r = 0, len(height) - 1 leftMax, rightMax = height[l], height[r] res = 0 while l < r: if leftMax < rightMax: l += 1 leftMax = max(leftMax, height[l]) res += leftMax - height[l] else: r -= 1 rightMax = max(rightMax, height[r]) "
Anonymous Roadrunner - "from typing import List def traprainwater(height: List[int]) -> int: if not height: return 0 l, r = 0, len(height) - 1 leftMax, rightMax = height[l], height[r] res = 0 while l < r: if leftMax < rightMax: l += 1 leftMax = max(leftMax, height[l]) res += leftMax - height[l] else: r -= 1 rightMax = max(rightMax, height[r]) "See full answer
Data Engineer
Data Structures & Algorithms
+4 more
Asked at Databricks • 9 months ago
How would you handle a task in a nightly job that fails unexpectedly during 10 percent of the runs?
Data Engineer
Data Pipeline Design
Add answer I was asked this
Data Engineer
Data Pipeline Design

🧠 Want an expert answer to a question? Saving questions lets us know what content to make next.

Asked at Discord • 8 months ago
Why do you want to work at Discord?
Data Engineer
Behavioral
+2 more
Add answer I was asked this
Data Engineer
Behavioral
+2 more
Asked at Adobe, Apple, Nvidia • 8 months ago
Build a Basic Regex Parser
IDE
Hard
Data Engineer
Data Structures & Algorithms
+3 more
2 answers I was asked this
"func isMatch(text: String, pattern: String) -> Bool { // Convert strings to arrays for easier indexing let s = Array(text.characters) let p = Array(pattern.characters) guard !s.isEmpty && !p.isEmpty else { return true } // Create DP table: dpi represents if s[0...i-1] matches p[0...j-1] var dp = Array(repeating: Array(repeating: false, count: p.count + 1), count: s.count + 1) // Empty pattern matches empty string dp[0]["
Vince S. - "func isMatch(text: String, pattern: String) -> Bool { // Convert strings to arrays for easier indexing let s = Array(text.characters) let p = Array(pattern.characters) guard !s.isEmpty && !p.isEmpty else { return true } // Create DP table: dpi represents if s[0...i-1] matches p[0...j-1] var dp = Array(repeating: Array(repeating: false, count: p.count + 1), count: s.count + 1) // Empty pattern matches empty string dp[0]["See full answer
Data Engineer
Data Structures & Algorithms
+3 more
Asked at Amazon • 3 years ago
What is the difference between OLTP and OLAP?
Data Engineer
Technical
+1 more
1 answer I was asked this
"OLTP (Online Transaction Processing) and OLAP (Online Analytical Processing) are two types of data processing systems, each designed for specific purposes in the context of database and data warehouse environments. OLTP (Online Transaction Processing):Purpose: OLTP systems are designed to manage and handle high volumes of transactions, such as inserting, updating, and deleting data. These systems are typically used in day-to-day business operations. Characteristics: Handles small, si"
Nikunj V. - "OLTP (Online Transaction Processing) and OLAP (Online Analytical Processing) are two types of data processing systems, each designed for specific purposes in the context of database and data warehouse environments. OLTP (Online Transaction Processing):Purpose: OLTP systems are designed to manage and handle high volumes of transactions, such as inserting, updating, and deleting data. These systems are typically used in day-to-day business operations. Characteristics: Handles small, si"See full answer
Data Engineer
Technical
+1 more
What types of indexes are in a relational database?
Data Engineer
Technical
1 answer I was asked this
"i said there is hashed, clustered, non-clustered"
Erjan G. - "i said there is hashed, clustered, non-clustered"See full answer
Data Engineer
Technical
Asked at Adobe, Apple, Microsoft • 8 months ago
Determine if an array of integers from 1 to n contains a duplicate in constant time and space.
Data Engineer
Data Structures & Algorithms
+2 more
1 answer I was asked this
"simply check its size if the size if the size is greater than n then yes it has duplicate"
Kunal kumar S. - "simply check its size if the size if the size is greater than n then yes it has duplicate"See full answer
Data Engineer
Data Structures & Algorithms
+2 more
Asked at Salesforce • 9 months ago
Why do you want to work at Salesforce?
Data Engineer
Behavioral
+4 more
Add answer I was asked this
Data Engineer
Behavioral
+4 more
Asked at Databricks • 9 months ago
When should you use Delta Live Tables over standard data pipelines built on Spark and Delta Lake?
Data Engineer
Data Pipeline Design
Add answer I was asked this
Data Engineer
Data Pipeline Design
Asked at Apple • 8 months ago
Sliding Window Maximum
Data Engineer
Coding
+1 more
Add answer I was asked this
Data Engineer
Coding
+1 more
Asked at Adobe, Apple, Goldman Sachs + 2 more • 8 months ago
Find the longest palindromic subsequence using dynamic programming.
IDE
Medium
Data Engineer
Data Structures & Algorithms
+3 more
9 answers I was asked this
+6
"function isPalindrome(s, start, end) { while (s[start] === s[end] && end >= start) { start++; end--; } return end <= start; } function longestPalindromicSubstring(s) { let longestPalindrome = ''; for (let i=0; i < s.length; i++) { let j = s.length-1; while (s[i] !== s[j] && i <= j) { j--; } if (s[i] === s[j]) { if (isPalindrome(s, i, j)) { const validPalindrome = s.substring(i, j+1"
Tiago R. - "function isPalindrome(s, start, end) { while (s[start] === s[end] && end >= start) { start++; end--; } return end <= start; } function longestPalindromicSubstring(s) { let longestPalindrome = ''; for (let i=0; i < s.length; i++) { let j = s.length-1; while (s[i] !== s[j] && i <= j) { j--; } if (s[i] === s[j]) { if (isPalindrome(s, i, j)) { const validPalindrome = s.substring(i, j+1"See full answer
Data Engineer
Data Structures & Algorithms
+3 more
Asked at Apple, Meta (Facebook), LinkedIn + 2 more • 3 months ago
Find the lowest common ancestor (LCA) of two nodes in a binary tree.
IDE
Medium
Data Engineer
Data Structures & Algorithms
+4 more
6 answers I was asked this
+2
"Make current as root. 2 while current is not null, if p and q are less than current, go left. If p and q are greater than current, go right. else return current. return null"
Vaibhav D. - "Make current as root. 2 while current is not null, if p and q are less than current, go left. If p and q are greater than current, go right. else return current. return null"See full answer
Data Engineer
Data Structures & Algorithms
+4 more
Explain the differences between Parquet and Avro.
Data Engineer
Technical
1 answer I was asked this
"i did not know ,but the answer is parquet is column-oriented, avro is row-oriented"
Erjan G. - "i did not know ,but the answer is parquet is column-oriented, avro is row-oriented"See full answer
Data Engineer
Technical
Asked at Adobe, Apple, Microsoft • 8 months ago
Top k frequent elements
Data Engineer
Coding
+3 more
1 answer I was asked this
"Leetcode 347: Heap + Hashtable Follow up question: create heap with the length of K instead of N (more time complexity but less space )"
Chen J. - "Leetcode 347: Heap + Hashtable Follow up question: create heap with the length of K instead of N (more time complexity but less space )"See full answer
Data Engineer
Coding
+3 more
Asked at Databricks • 9 months ago
What is delta lake?
Data Engineer
Data Pipeline Design
Add answer I was asked this
Data Engineer
Data Pipeline Design
Asked at LinkedIn, Oracle, TikTok • 8 months ago
Serialize and deserialize binary tree
Data Engineer
Data Structures & Algorithms
+2 more
Add answer I was asked this
Data Engineer
Data Structures & Algorithms
+2 more
Explain the differences between wide and narrow dependencies in Apache Spark.
Data Engineer
Technical
1 answer I was asked this
"i failed to answer, did not know"
Erjan G. - "i failed to answer, did not know"See full answer
Data Engineer
Technical