51 practice questions for Coinbase Data Scientist interviews
Coinbase data scientist interviews test statistical reasoning, ML model design, SQL proficiency, A/B testing methodology, and Python-based algorithm implementation.
Category: String coding problem# Question You are designing an NFT generation engine. You are given a set of Traits, where each trait has a name and a list of possible...Input: List Output: Array
codingMediumVerified Question#2
2. Blockchain Mining
Category: Dynamic programming coding problem# Question You are building a block construction module for a blockchain node. The goal is to select a subset of pending transactions to include in...Input: Graph (nodes and edges) Output: Computed result
codingMediumVerified Question#3
3. Crypto Trading System Stream
Category: String coding problem# Question Design a crypto trading system that manages a stream of orders. The system should support various operations like placing, pausing,...Input: Array of strings Output: Computed result
codingHardVerified Question#4
4. Design Iterators
Category: Array coding problem# Question For this problem, you will be designing a series of different iterator classes. This problem is split into multiple related parts that...Input: Array of integers Output: Computed result
codingMediumVerified Question#5
5. Food Delivery System
Category: Trie-based coding problem# Question For this problem, you will be designing a food delivery system. This problem is split into three related parts, evolving from basic data...Input: List Output: Computed result
codingHardVerified Question#6
6. Transaction System
Category: Tree coding problemFor this problem, you will be designing a system to handle financial transactions and account balances. This problem is split into three related...Input: List Output: Integer
codingHardVerified Question#7
7. OA[CodeSignal] Cloud File Storage System
Category: Graph coding problem# Question Your task is to implement a simple in-memory cloud storage system that maps objects (files) to their metadata (name, size, etc.). You...Input: Graph (nodes and edges) Output: Array
codingHardVerified Question#8
8. OA[CodeSignal] Design Banking System
Category: Graph coding problem# Question Design a banking system that supports account management, transactions, and various financial operations.Input: Graph (nodes and edges) Output: Computed result
codingHardVerified Question#9
9. Capital Gains Tax Calculator
Category: String coding problemYou are given a chronologically sorted list of stock transactions. Each transaction is a list of strings in the format `[<timestamp>, <type>,...Input: Array of strings Output: Computed result
codingMediumVerified Question#10
10. Service Log Aggregator
Category: Trie-based coding problemA distributed system emits log entries from multiple services and worker threads. Each log entry is a colon-separated string in the format...Input: Array Output: Computed result
codingHardVerified Question#11
11. OA [CodeSignal] Knowledge Base System
Category: Graph coding problemDesign and implement a personal knowledge base called KnowledgeBaseSystem that stores articles with CRUD operations. The system operates entirely...Input: Graph (nodes and edges) Output: Computed result
codingMediumVerified Question#12
12. OA [CodeSignal] Workspace Tracker
Category: Interval-based coding problemBuild a system to track desk workers at a shared office space. The system records when each worker enters and leaves and computes how long they have...Input: String Output: Array
codingHardVerified Question#13
13. Transaction Query Engine
Category: String coding problemDesign a system to filter and paginate a list of transaction records. Each record is a list of strings in the format `[timestamp, id, userId,...Input: Array of strings Output: Computed result
codingMediumVerified Question#14
14. Exchange Rate Finder
Category: String coding problemYou are given a set of currency exchange relationships. Each relationship specifies a direct exchange rate between two currencies. Rates are...Input: List Output: Computed result
codingHardVerified Question#15
15. Order Matching Engine
Category: String coding problemYou are managing a cryptocurrency order book. The book holds buy and sell orders placed by traders. - A buy order indicates the maximum price a...Input: String Output: Computed result
codingHardVerified Question#16
16. Account Transfer System
Category: String coding problemYou are given a list of fund transfer instructions and a set of accounts with initial balances. Each transfer moves a fixed percentage of the...Input: List Output: Computed result
codingHardVerified Question#17
17. Restaurant Delivery Network
Category: String coding problemYou are building a food discovery platform. Given a user's location, a list of restaurants with their coordinates, and a menu of items with prices,...Input: List Output: Computed result
codingMediumhash map#1
1. [OA] Hash Map — Count Distinct Products in Transactions
Coinbase often analyses transaction data to understand product adoption. You need to help find the number of distinct products a user has interacted with in a series of transaction records.Problem statement: Given a list of transaction records where each record is represented as a tuple containing (user_id, product_id), write a function to return the number of distinct products that each user has interacted with.- Method Signature: def count_distinct_products(transactions: List[Tuple[int, int]]) -> Dict[int, int]: Returns a dictionary with user_id as keys and the count of distinct product_ids as values.Example 1: Input: transactions = [(1, 101), (1, 102), (2, 101), (1, 101), (2, 103)] Output: {1: 2, 2: 2} Explanation: User 1 interacted with products 101 and 102, while User 2 interacted with products 101 and 103.Constraints: - 1 <= len(transactions) <= 10^5 - 1 <= user_id, product_id <= 10^4
codingMediumsliding window#2
2. [OA] Sliding Window — Find the Maximum Subarray Sum for Transaction History
Coinbase analytics processes vast amounts of transaction history data. You need to help the data team find the maximum sum of any contiguous subarray of transaction amounts over a specified transaction window.Problem statement: Write a function that returns the maximum sum of a contiguous subarray of n transaction amounts within the transaction history array. You need to capture the transaction amounts over a window_size.- Method Signature: def max_subarray_sum(transactions: List[int], window_size: int) -> int: Returns the maximum sum of a contiguous subarray of the specified size.Example 1: Input: transactions = [1, -2, 3, 4, -1, 2, 1, -5, 4], window_size = 3 Output: 6 Explanation: The subarray [3, 4, -1] has the maximum sum of 6.Constraints: - 1 <= len(transactions) <= 10^5 - -10^4 <= transactions[i] <= 10^4 - 1 <= window_size <= len(transactions)