0

I have two Pandas data frames df1 and df2 which should have the same inferred Pandera schema. Unfortunately they do not because when I run pa.infer_schema(df1) != pa.infer_schema(df2) I get a return of False. The print out (which should be __repr__) of these schema look identical under visual inspection, so I suspect the difference is something having to do with different instances. But I am unsure about that.

How can I get a "diff" between Pandera schema to help me more quickly understand why they are not equal?

Galen
  • 1,128
  • 1
  • 14
  • 31

1 Answers1

0

I found that pprint on the output of DeepDiff revealed that the minima and maxima of certain columns were different between the data frames! Inferring these min/max constraints was not behavior I expected or desired default.

Galen
  • 1,128
  • 1
  • 14
  • 31