Coinbase logo

Coinbase Data Scientist Coding Questions

51 practice questions for Coinbase Data Scientist interviews

Coinbase data scientist interviews test statistical reasoning, ML model design, SQL proficiency, A/B testing methodology, and Python-based algorithm implementation.

All Roles Software Engineer Backend Engineer Frontend Engineer Full Stack Engineer Mobile Engineer Data Engineer Data Scientist ML Engineer DevOps Engineer DevOps Engineer Product Manager SRE Security Engineer Engineering Manager Data Analyst UX/UI Designer QA Engineer
coding Medium Verified Question #1

1. Generate NFT


Category: String coding problem
# Question You are designing an NFT generation engine. You are given a set of Traits, where each trait has a name and a list of possible...
Input: List
Output: Array
coding Medium Verified Question #2

2. Blockchain Mining


Category: Dynamic programming coding problem
# Question You are building a block construction module for a blockchain node. The goal is to select a subset of pending transactions to include in...
Input: Graph (nodes and edges)
Output: Computed result
coding Medium Verified Question #3

3. Crypto Trading System Stream


Category: String coding problem
# Question Design a crypto trading system that manages a stream of orders. The system should support various operations like placing, pausing,...
Input: Array of strings
Output: Computed result
coding Hard Verified Question #4

4. Design Iterators


Category: Array coding problem
# Question For this problem, you will be designing a series of different iterator classes. This problem is split into multiple related parts that...
Input: Array of integers
Output: Computed result
coding Medium Verified Question #5

5. Food Delivery System


Category: Trie-based coding problem
# Question For this problem, you will be designing a food delivery system. This problem is split into three related parts, evolving from basic data...
Input: List
Output: Computed result
coding Hard Verified Question #6

6. Transaction System


Category: Tree coding problem
For this problem, you will be designing a system to handle financial transactions and account balances. This problem is split into three related...
Input: List
Output: Integer
coding Hard Verified Question #7

7. OA[CodeSignal] Cloud File Storage System


Category: Graph coding problem
# Question Your task is to implement a simple in-memory cloud storage system that maps objects (files) to their metadata (name, size, etc.). You...
Input: Graph (nodes and edges)
Output: Array
coding Hard Verified Question #8

8. OA[CodeSignal] Design Banking System


Category: Graph coding problem
# Question Design a banking system that supports account management, transactions, and various financial operations.
Input: Graph (nodes and edges)
Output: Computed result
coding Hard Verified Question #9

9. Capital Gains Tax Calculator


Category: String coding problem
You are given a chronologically sorted list of stock transactions. Each transaction is a list of strings in the format `[<timestamp>, <type>,...
Input: Array of strings
Output: Computed result
coding Medium Verified Question #10

10. Service Log Aggregator


Category: Trie-based coding problem
A distributed system emits log entries from multiple services and worker threads. Each log entry is a colon-separated string in the format...
Input: Array
Output: Computed result
coding Hard Verified Question #11

11. OA [CodeSignal] Knowledge Base System


Category: Graph coding problem
Design and implement a personal knowledge base called KnowledgeBaseSystem that stores articles with CRUD operations. The system operates entirely...
Input: Graph (nodes and edges)
Output: Computed result
coding Medium Verified Question #12

12. OA [CodeSignal] Workspace Tracker


Category: Interval-based coding problem
Build a system to track desk workers at a shared office space. The system records when each worker enters and leaves and computes how long they have...
Input: String
Output: Array
coding Hard Verified Question #13

13. Transaction Query Engine


Category: String coding problem
Design a system to filter and paginate a list of transaction records. Each record is a list of strings in the format `[timestamp, id, userId,...
Input: Array of strings
Output: Computed result
coding Medium Verified Question #14

14. Exchange Rate Finder


Category: String coding problem
You are given a set of currency exchange relationships. Each relationship specifies a direct exchange rate between two currencies. Rates are...
Input: List
Output: Computed result
coding Hard Verified Question #15

15. Order Matching Engine


Category: String coding problem
You are managing a cryptocurrency order book. The book holds buy and sell orders placed by traders. - A buy order indicates the maximum price a...
Input: String
Output: Computed result
coding Hard Verified Question #16

16. Account Transfer System


Category: String coding problem
You are given a list of fund transfer instructions and a set of accounts with initial balances. Each transfer moves a fixed percentage of the...
Input: List
Output: Computed result
coding Hard Verified Question #17

17. Restaurant Delivery Network


Category: String coding problem
You are building a food discovery platform. Given a user's location, a list of restaurants with their coordinates, and a menu of items with prices,...
Input: List
Output: Computed result
coding Medium hash map #1

1. [OA] Hash Map — Count Distinct Products in Transactions

Coinbase often analyses transaction data to understand product adoption. You need to help find the number of distinct products a user has interacted with in a series of transaction records.
Problem statement: Given a list of transaction records where each record is represented as a tuple containing (user_id, product_id), write a function to return the number of distinct products that each user has interacted with.
- Method Signature: def count_distinct_products(transactions: List[Tuple[int, int]]) -> Dict[int, int]: Returns a dictionary with user_id as keys and the count of distinct product_ids as values.
Example 1:
Input: transactions = [(1, 101), (1, 102), (2, 101), (1, 101), (2, 103)]
Output: {1: 2, 2: 2}
Explanation: User 1 interacted with products 101 and 102, while User 2 interacted with products 101 and 103.
Constraints:
- 1 <= len(transactions) <= 10^5
- 1 <= user_id, product_id <= 10^4
coding Medium sliding window #2

2. [OA] Sliding Window — Find the Maximum Subarray Sum for Transaction History

Coinbase analytics processes vast amounts of transaction history data. You need to help the data team find the maximum sum of any contiguous subarray of transaction amounts over a specified transaction window.
Problem statement: Write a function that returns the maximum sum of a contiguous subarray of n transaction amounts within the transaction history array. You need to capture the transaction amounts over a window_size.
- Method Signature: def max_subarray_sum(transactions: List[int], window_size: int) -> int: Returns the maximum sum of a contiguous subarray of the specified size.
Example 1:
Input: transactions = [1, -2, 3, 4, -1, 2, 1, -5, 4], window_size = 3
Output: 6
Explanation: The subarray [3, 4, -1] has the maximum sum of 6.
Constraints:
- 1 <= len(transactions) <= 10^5
- -10^4 <= transactions[i] <= 10^4
- 1 <= window_size <= len(transactions)

Related Coinbase Data Scientist interview prep

Start practicing Coinbase questions

Sign up for free to access walkthroughs, AI-generated questions, and more.

Get Started Free