[ ] Examine data types, missing value patterns, and distributions for every feature. - [ ] Understand the domain context: what do the features represent physically? - [ ] Identify the target variable and check its distribution (balanced vs. imbalanced for classification, skewed vs. symmetric for reg