Section outline

    •   Data cleaning, integration, transformation
    •   Handling missing data and noise
    •   Feature selection and dimensionality reduction

    Reading

    ·       Han, Kamber & Pei (Ch. 2–3)

    • Lab: Data preprocessing using Python (Pandas/Scikit-learn)
    • Case Study: Retail dataset cleaning for sales forecasting