Background: Senior Data Scientist with ~3 YOE
Phone Screen (60 min)
Format: Coding + Problem Solving
Problem Solving: Behavioral scenarios and use cases
Coding: Min Stack + follow-ups
Outcome: Passed to onsite
Onsite Loop (4 rounds, 60 min each)
Note: Recruiter's prep material was different from actual rounds for two rounds.
Round 1: ML Fundamentals + ML Coding
Actual Format: As described
ML Coding: Implement K-means from scratch
Follow-up: How would you vectorize this implementation?
(I struggled a bit with matrix broadcasting)
Round 2: ML Problem Solving + ML System Design
Actual Format: ML fundamentals + coding (no system design)
ML Questions (that I remember):
Coding: Find max number of points on a line (2D array of points)
I spent time handling floating point precision loss but got optimized solution
Round 3: Data Analysis + Applied Sciences
Actual Format: ML questions + coding
ML Questions:
Coding: Implement self-attention and masked self-attention
I got mask syntax slightly wrong but overall code was correct and optimal otherwise.
Round 4: Problem Solving + Coding (HackerRank)
Format: As described
Coding: Merge intervals
ML Fundamentals: Bias-variance tradeoff, bagging, boosting, calibration, drift
Behavioral: Standard behavioral questions (don't remember specifics)
Offer