2022年7月28日 星期四

data cleansing

 data cleansing

2022/07/28

-----


https://pixabay.com/zh/illustrations/people-silhouettes-lots-collection-943873/

-----

一、五大步驟


「5 Steps in Data Cleaning

1. Identify data that needs to be cleaned and remove 

2. Fix structural mistakes

3. Set data cleansing techniques

4. Filter outliers and fix missing data

5. Implement processes」[1]。

「資料清洗的 5 個步驟

1. 識別需要清理和刪除的資料

2.修復結構錯誤

3.設置資料清洗技術

4.過濾異常值並修復缺失資料

5. 實施流程」

-----

二、outliers 與 missing data

「Filter outliers and fix missing data」[1]。

-----

References


[1] What Is Data Cleaning and The Growing Importance Of Data Cleaning

https://www.expressanalytics.com/blog/growing-importance-of-data-cleaning/


[2] What is Data Cleansing (Data Cleaning, Data Scrubbing)?

https://www.techtarget.com/searchdatamanagement/definition/data-scrubbing

-----

Pandas(目錄)

https://mandhistory.blogspot.com/2022/05/pandas.html

-----

Python Machine Learning(目錄)

https://mandhistory.blogspot.com/2022/05/python.html

-----

沒有留言:

張貼留言

注意:只有此網誌的成員可以留言。