Amazon Data Engineering | Phone Screen
Anonymous User
2009

SQL :

Below Tables are given

Shipments- Shipment_id,Price, Order_id, Ship_date, Delivery_Location, Price, Ship_method , ShipETA,
Customer-Customer_id, order_id, customer_address, prime_eligible
Order - Order_id , Order_Qty, Order_date , Order_location, Item_id , Shipment_id
Item- Item _id , Item_description, Item_Location

Question 1: List of customer ids who ordered more in 2019 than they did in 2018
Question 2 : List of customers who placed orders on two consecutive days in the month of July 2020

Python:

Below json given

{	
	"Employees" : [
	{
	"userId":"xxxx",
	"jobTitleName":"Developer",
	"employeeCode":"E1",
	"region":"WA",
	"salary":8000
	
	},
	{
	"userId":"yyyyy",
	"jobTitleName":"Developer",
	"employeeCode":"E1",
	"region":"WA",
	"salary":7000
	},
	{
	"userId":"zzzz",
	"jobTitleName":"Developer",
	"employeeCode":"E1",
	"region":"WA",
	"salary":4000
	}
	]
}

find the max salary.

Data Engineering

Belwo are the step peformed by your job
Load file
Filter the records
Sum
Group by

How many stages DAG will produce

How do you optimize query/table in Redshift ?

Comments (9)