Live Coding: Cleaning/Debriding Wound Data

Описание к видео Live Coding: Cleaning/Debriding Wound Data

Proudly sponsored by Open Wound Research, https://www.openwoundresearch.com/

Join Zweli in this live coding session, "Cleaning/Debriding Wound Data." Watch as we transform a disorganized wound care dataset into a clean, structured format. We’ll walk you through loading Python libraries, cleaning columns and rows, fixing data types, and performing sanity checks. By the end, we’ll export the cleaned data to both Excel and Parquet files for seamless use in analysis and software applications. Perfect for those looking to master data cleaning in Python!

Theme music is « Real as it Gets (Dirty Deed) », by Justin G. Marcellus

Timestamps:
0:00 Intro and Objective
02:18 The Pranic Healing DFU Study & Data
03:48 VS Code, import libraries, and csv
09:20 Getting a wide view of the data
14:00 Clean column name whitespace and case
25:24 Drop columns
27:36 Rename columns
41:30 Fix column datatypes
1:29:07 Fix wound area and a1c abs and pct change
1:39:44 Some quick data exploration
1:45:14 Export excel and parquet file
1:46:10 Install fastexcel in terminal
1:48:58 Viewing resulting excel file

Links from the episode:
The study data comes from the Pranic DFU study, which can be found here: https://www.ncbi.nlm.nih.gov/pmc/arti...
The data can be downloaded from here: https://figshare.com/articles/dataset...
The Polars user guide can be found here: https://docs.pola.rs/
Visual Studio Code IDE is free and can be downloaded from here: https://code.visualstudio.com/download
What is a parquet file: https://parquet.apache.org/
The code and data can be found at OWR’s github repo at: https://github.com/OpenWoundResearch/...

Follow us:
LinkedIn:   / 90449662  
Facebook: https://www.facebook.com/profile.php?...
Instagram:   / openwoundresearch  

Thanks for watching!

Комментарии

Информация по комментариям в разработке