First round of Data Engineering Interview | Product Based Companies | Ecom Express Limited | Walmart

Описание к видео First round of Data Engineering Interview | Product Based Companies | Ecom Express Limited | Walmart

This is a mock interview for the Data Engineering role. This first round of Data Engineering interviews consists of questions related to Big Data, SQL and basic Data Structure.

Here I have connected with one excellent mind named Sanyam Jain who is working as a Data Engineer at Ecom Express. I hope this mock interview will help someone who is preparing for Data Engineering or Big Data interviews.

🔅 To book a Mock interview - https://topmate.io/ankur_ranjan/15155

𝗝𝗼𝗶𝗻 𝗺𝗲 𝗼𝗻 𝗦𝗼𝗰𝗶𝗮𝗹 𝗠𝗲𝗱𝗶𝗮:
🔅 Topmate - (Book 1:1 or other sessions)
https://topmate.io/ankur_ranjan
🔅 LinkedIn -   / thebigdatashow  
🔅 Instagram -   / ranjan_anku  

Aakash Dhal's LinkedIn profile
🔅   / sanyam-jain-270979155  

Chapters
00:00 - Introduction
01:35 - Project Explanation - Satyam has developed an automated Real Time Refreshed Data Lake ETL pipeline using Kafka, Debezium, Spark Structured Streaming and Apache Hudi and Airflow (for scheduling of scripts) where data is stored as Hive Tables on AWS Cloud s3 using AWS Glue and EMR in about 10 months. Using this Data Lake as a source his team have developed a Real-Time Dashboard thus generating real-time analytics.
10:14 - What is checkpointing in Apache Spark?
10:43 - Difference b/w checkpointing the data and persisting the data.
20:56 - Write an SQL query to report the customer ids from the Customer table that bought all the products in the Product table.
32:00 - Write an SQL query that reports the buyers who have bought an S8 but not iPhone. Note that S8 and iPhone are products present in the Product table.
42:29 - Discussion on basic Data Structure.

#DataEngineering #interview #bigdata #apachespark #dataengineerjob #careerswitch #job #dataengineeringessentials #mockinterview

Комментарии

Информация по комментариям в разработке