9 practice questions for Yelp Data Engineer interviews
Yelp data engineer interviews cover ETL pipelines, data modelling, SQL optimisation, distributed systems like Spark and Kafka, and warehouse architecture.
Category: String coding problemGiven a string of space-separated words and a dictionary of recognized phrases, split the string into tokens. Phrases in the dictionary represent...Input: String Output: Computed result
codingHardVerified Question#2
2. Conference Buddy Pairing
Category: String coding problemYou are organizing a conference and want to pair up attendees as buddies. Each attendee belongs to a department.Input: Array of pairs Output: Integer
codingMediumVerified Question#3
3. Dependency Chain End
Category: Graph coding problemYou are given a list of task dependency pairs [a, b] meaning task a is blocked by task b (i.e., a must wait for b). Each task blocks at...Input: Graph (nodes and edges) Output: Computed result
codingEasyVerified Question#4
4. Jaccard Word Similarity
Category: String coding problemGiven two document title strings doc1 and doc2, compute their word-level Jaccard similarity. Split each title into words by spaces and treat each...Input: String Output: Computed result
codingMediumVerified Question#5
5. Overlap String Join
Category: String coding problemGiven two code snippet strings s1 and s2, merge them by finding the longest suffix of s1 that exactly matches a prefix of s2. Join the two...Input: String Output: Printed output
codingMediumVerified Question#6
6. Product Search Engine
Category: Algorithm coding problemDesign a ProductSearch class for a product catalog search engine. The constructor takes a list of product names. The search(keyword) method...Input: List Output: Computed result
codingMediumVerified Question#7
7. Top K Active Customers
Category: String coding problemYou are given a list of support tickets where each ticket is [agent, customer, issue]. A customer's engagement score is the number of unique agents...Input: List Output: Computed result
codingMediumVerified Question#8
8. Top K Keyword Products
Category: Algorithm coding problemYou are given a list of product catalog item names and a search prefix. Return the top k item names where at least one word in the name starts with...Input: List Output: Array
codingMediumVerified Question#9
9. Traffic Spike Counter
Category: Sliding window coding problemYou are given an array of hourly traffic measurements and a spike detection config. Count how many sliding windows of a given size trigger a spike...Input: Array Output: Computed result