Databricks - Handle Corrupt Records in PySpark (CSV & Json)

Описание к видео Databricks - Handle Corrupt Records in PySpark (CSV & Json)

In this video we see how to handle erroneous data when loading csv and json files. There are 3 modes, Permissive, DropMalformed and Failfast. We see all of them individually.

Follow me on social media:
LinkedIn: www.linkedin.com/in/apostolos-athanasiou-9a0baa119
GitHub: https://github.com/apostolos1927/

00:00 - Intro
01:00 - Handle Corrupt records

Комментарии

Информация по комментариям в разработке