The task is to provide real time analytics
How many VMs added/deleted in a given range
Most used Product
Count of each operation
Questions:
How will process the file?
How will you design db for analytics? what tech sql-nosql
How will you scale for 100TB file per min
What tools/technologies will be used?