Which Python method can be used to Remove duplicates by Data scientist?
The drop_duplicates() method removes duplicate rows.
dataframe.drop_duplicates(subset, keep, inplace, ignore_index)
Remove duplicate rows from the DataFrame:
1. import pandas as pd
2. data = {
3. 'name': ['Peter', 'Mary', 'John', 'Mary'],
4. 'age': [50, 40, 30, 40],
5. 'qualified': [True, False, False, False]
6. }
7.
8. df = pd.DataFrame(data)
9. newdf = df.drop_duplicates()
Pilar
3 months agoStarr
3 months agoRenea
3 months agoCora
4 months agoMeaghan
4 months agoTammara
4 months agoDevorah
4 months agoBeth
4 months agoAriel
5 months agoJanet
5 months agoKina
5 months agoIsreal
5 months agoTonette
5 months agoWalker
5 months agoCecil
5 months agoRozella
5 months agoEmelda
1 year agoBulah
1 year agoKirby
1 year agoFannie
1 year agoDolores
1 year agoVincent
1 year agoSolange
1 year agoBambi
1 year agoDenna
1 year agoMy
1 year agoGlory
2 years agoAudry
1 year agoDulce
1 year agoCandra
2 years agoSylvie
2 years agoPilar
1 year agoEdgar
1 year agoAmalia
1 year agoBarbra
2 years agoIvette
2 years agoLynelle
2 years agoKimberely
1 year agoDominic
1 year agoDominga
1 year agoHuey
2 years ago