Epam | 1st Round | Senior Data Engineer
Anonymous User
3003

It was 1.30hr discussion with a senior DE .Below topics we discussed:

  1. All the tools and tech stack I ever worked on
  2. What is diff b/w dag and lineage graph
  3. Architecture of spark
  4. Optimisations in spark
  5. Optimisations in bq , sql
  6. Indexes in sql
  7. Find all EMP whose dept and Sal are same
  8. Python lambda functions, why it is faster
  9. Python decorator
  10. Python : count all occurrences of words in a string (has map)
  11. Ci cd
  12. Working methodology
  13. Spark job, no of task , actions
  14. Narrow and wide transformations
  15. Shuffling
  16. SCD types
  17. CDC
  18. Fact dimension tables , star and snowflake schema
  19. Query plan in sql
  20. Spark query physical plan
  21. Joins , physical joins
  22. What is data source and data sink when you were using spark application
  23. Optimisations in airflow
  24. Airflow variable configuration
  25. Airflow task depency
  26. Airflow architecture
  27. Acid properties
  28. DWH and delta lake
  29. Bq denormalizations
  30. Normalisation vs denormalisation , which is better
  31. Coalesce repartition
  32. Lazy evaluation
  33. What are the ways to upload data in bq
  34. How you will upload a 10 gb csv file in bq
  35. How will you delete duplicates from tables in bq
  36. Bq data types
  37. Security features in gcp
  38. Ways to compare files data for data validation
  39. How are you good with dealing with clients
Comments (3)