0

I'm looking for the rationale about the method used by pandas profiling tool to identify duplicates rows (in a dataframe with multiple columns)? I couldn't find it in Pandas Profiling documentation.

TylerH
  • 20,799
  • 66
  • 75
  • 101

1 Answers1

0

See model/summary line 571-575.

In other cases, can be simplified as sum(df.duplicated())

loopy
  • 441
  • 3
  • 8