Clean the DataSet.
- 1. Dealing with Missing Data
- 2. Dealing with Duplicates
- 3. Outlier Detection
- 4. Encode Categorical Features
- 5. Transformation
Steps to implement :
- Open Jupyter Notebook.
- Declare a dummy dataframe using pandas.
- Then implement the below methods.
1. Methods to check missing values
data:image/s3,"s3://crabby-images/9f64b/9f64becfcb04d7da72ab2dc926024086ecf816f0" alt=""
2. Removing Missing Data
data:image/s3,"s3://crabby-images/67e27/67e27169a5182abfd52d3a160b5f0b6c9378f159" alt="missing data"
3. Filling Missing Data
data:image/s3,"s3://crabby-images/de4e0/de4e0c8f76837c315f8561384385107820f3fa0e" alt="filling missing values"
4. Dealing with Duplicates
data:image/s3,"s3://crabby-images/e9417/e94172472edf49b8254842a60454336021f3ea12" alt="dealing with duplicates"
5. Outlier Detection & Handling
data:image/s3,"s3://crabby-images/496a6/496a6c869a3aa6bb8cae0b638d7b07165b0ccca6" alt="encoding"
6. Encoding Categorical Features
data:image/s3,"s3://crabby-images/496a6/496a6c869a3aa6bb8cae0b638d7b07165b0ccca6" alt="encoding"
7. Feature Transformation
data:image/s3,"s3://crabby-images/88573/8857354632d544b28e52cd6f1cdf0a7a2b356505" alt="feature transformation"
8.Detecting and Handling Infinite Values
data:image/s3,"s3://crabby-images/a07dd/a07dd5cdfd30bb6449176c03e8304fe95666a649" alt="missing values"
When to Remove vs. Fill Missing Data?
data:image/s3,"s3://crabby-images/df9a2/df9a2a7623b78dc89516f478a2ab1bb1cc8f6164" alt="Missing-Data-Handling-Cheat-Sheet-02-02-2025_07_44_PM"
Useful Resources :
-
Python Playlist
-
AI Career Path in 2025
-
Machine Learning
-
Learn Statistic
-
Learn Data Visualization
-
Data Analyst Interview Preparation Guide