<?xml version="1.0" encoding="UTF-8"?><oembed><type>video</type><version>1.0</version><html>&lt;iframe src=&quot;https://www.loom.com/embed/8ce9f339554c456f8f6fd7777605750a&quot; frameborder=&quot;0&quot; width=&quot;1920&quot; height=&quot;1440&quot; webkitallowfullscreen mozallowfullscreen allowfullscreen&gt;&lt;/iframe&gt;</html><height>1440</height><width>1920</width><provider_name>Loom</provider_name><provider_url>https://www.loom.com</provider_url><thumbnail_height>1440</thumbnail_height><thumbnail_width>1920</thumbnail_width><thumbnail_url>https://cdn.loom.com/sessions/thumbnails/8ce9f339554c456f8f6fd7777605750a-f712e0e8792663ef.gif</thumbnail_url><duration>160.838</duration><title>Preparing Our Dataset for Analysis</title><description>In this video, I walk you through the steps to prepare our dataset for analysis, focusing on setting categorical variables and handling missing values. I highlight that approximately 20 to 30% of our categorical variables have missing values, which we will impute with a placeholder. For continuous variables, we&apos;ll use -999 for missing values. I also discuss the importance of data splitting before feature engineering to avoid leakage, and I provide the specific ratios for our train, test, and validation sets. Please make sure to follow these steps as we move forward.</description></oembed>