Which Python method can be used to Remove duplicates by Data scientist?
The drop_duplicates() method removes duplicate rows.
dataframe.drop_duplicates(subset, keep, inplace, ignore_index)
Remove duplicate rows from the DataFrame:
1. import pandas as pd
2. data = {
3. 'name': ['Peter', 'Mary', 'John', 'Mary'],
4. 'age': [50, 40, 30, 40],
5. 'qualified': [True, False, False, False]
6. }
7.
8. df = pd.DataFrame(data)
9. newdf = df.drop_duplicates()
Emelda
9 months agoBulah
7 months agoKirby
8 months agoFannie
8 months agoDolores
8 months agoVincent
9 months agoSolange
9 months agoBambi
9 months agoDenna
9 months agoMy
9 months agoGlory
9 months agoAudry
8 months agoDulce
9 months agoCandra
9 months agoSylvie
9 months agoPilar
8 months agoEdgar
9 months agoAmalia
9 months agoBarbra
9 months agoIvette
9 months agoLynelle
9 months agoKimberely
8 months agoDominic
9 months agoDominga
9 months agoHuey
9 months ago