Data validation between source and target table | PySpark Interview Question |

Описание к видео Data validation between source and target table | PySpark Interview Question |

Hello Everyone,


source_data = [(1,'A'),(2,'B'),(3,'C'),(4,'D'),(5,'E')]
source_schema = ['id','name']
source_df = spark.createDataFrame(source_data,source_schema)
source_df.show()

target_data = [(1,'A'),(2,'B'),(3,'X'),(4,'F'),(6,'G')]
target_schema = ['id','name']
target_df = spark.createDataFrame(target_data,target_schema)
target_df.show()


This series is for beginners and intermediate level candidates who wants to crack PySpark interviews

Here is the link to the course : https://www.geekcoders.co.in/courses/...


#pyspark #interviewquestions #interview #pysparkinterview #dataengineer #aws #databricks #python

Комментарии

Информация по комментариям в разработке