Which Python method can be used to Remove duplicates by Data scientist?
The drop_duplicates() method removes duplicate rows.
dataframe.drop_duplicates(subset, keep, inplace, ignore_index)
Remove duplicate rows from the DataFrame:
1. import pandas as pd
2. data = {
3. 'name': ['Peter', 'Mary', 'John', 'Mary'],
4. 'age': [50, 40, 30, 40],
5. 'qualified': [True, False, False, False]
6. }
7.
8. df = pd.DataFrame(data)
9. newdf = df.drop_duplicates()
Emelda
11 months agoBulah
10 months agoKirby
10 months agoFannie
10 months agoDolores
11 months agoVincent
11 months agoSolange
11 months agoBambi
11 months agoDenna
11 months agoMy
11 months agoGlory
12 months agoAudry
11 months agoDulce
11 months agoCandra
12 months agoSylvie
12 months agoPilar
11 months agoEdgar
11 months agoAmalia
11 months agoBarbra
12 months agoIvette
12 months agoLynelle
12 months agoKimberely
11 months agoDominic
11 months agoDominga
12 months agoHuey
12 months ago