Applied Intuition Interview Questions (2026-2027)
17 questions · 8 experiences · InterviewDB (17) · 1p3a (6) · LeetCode (2)
Browse by role
Top topics
25 entries
Big Data Design: Architect a Scalable Pipeline for Petabyte-Scale Log Processing
Campsite Booking: Find Available Campsites Given Reservation Intervals
Applied Intuition SWE Phone - Encode String (Strings/Encoding)
Applied Intuition SWE Phone - Formula Evaluation (Stack/Parsing)
Group Coordinates: Cluster 2D Points by Proximity
Job Monitor: Track Long-Running Jobs and Alert on Failures or Timeouts
Applied Intuition SWE Onsite - Key Value Store (Hash Table/Design)
Lane Segment: Compute the Segment a Point Belongs to on a Road Lane
Applied Intuition SWE Phone - Matrix Layer Rotation (Matrix/Simulation)
Message Parser: Parse Structured Protocol Messages into Field-Value Pairs
Mine Game: Determine Winning Strategy in a Two-Player Minefield Traversal Game
Points Query: SQL to Find All Points Within a Given Radius of a Location
Points Search: Find the K Nearest Points to an Origin Using a Priority Queue
UART Handler: Implement a Software UART Receiver That Decodes Serial Frames
Applied Intuition SWE Phone - Vehicle Collision (Arrays/Simulation)
Vehicle Velocity: Compute Instantaneous and Average Velocity from a GPS Trace
Vertices Compression: Reduce a 3D Mesh's Vertex List by Merging Near-Duplicate Vertices
Applied Intuition Fulltime Software Engineer Tech Phone Screen Interview Experience
Applied Intuition Software Engineer HR Screen Interview Outcome
Applied Intuition HR Screening Experience for Software Engineer Role
Applied Intuition HR Phone Screen and Online Assessment Experience
Applied Intuition Fulltime SDE Tech Phone Screen Interview
Applied Intuition RS Spring Video Interview Experience for Machine Learning Fulltime Role
#317 Shortest Distance from All Buildings
#609 Find Duplicate File in System
Big Data Design: Architect a Scalable Pipeline for Petabyte-Scale Log Processing
Question Details
Problem
You are asked to design a system that ingests 10 TB of application logs per day, processes them for anomaly detection and aggregation, and serves dashboards with sub-second query latency. Walk through the architecture.
Requirements:
- Ingest: 10 TB/day, 100k events/sec peak
- Processing: real-time aggregation + batch anomaly detection
- Query: dashboard queries over last 7 days, p99 < 500ms
- Retention: 90 days hot, 3 years cold
- Fault tolerance: no data loss
Proposed architecture:
Producers -> Kafka (partitioned by service) -+-> Flink (real-time agg)
| -> Redis (hot counters)
+-> S3 (raw parquet, partitioned by date)
-> Spark (batch anomaly, daily)
-> Trino/Presto (ad-hoc SQL)
Dashboard -> Druid (pre-agg OLAP)
Follow-ups
- Why partition Kafka by service rather than by timestamp? What are the tradeoffs?
- How do you handle late-arriving events in the Flink streaming layer?
- What compaction and partitioning strategy on S3 enables fast Trino queries?
- How would you implement exactly-once semantics end-to-end from Kafka to the database?
Topics
More from Applied Intuition
People also viewed