2

I am comparing two sheets using Merge Rows (diff).

1st excel sheet:

enter image description here

2nd excel sheet:

enter image description here

and my pentaho transaction:

enter image description here

in preview data showing, that id 2.0 at 2nd row is add new row and at 4 row its showing same data is delete, its suppose to identical, so how it can be achieve.

Ravindra Gupta
  • 1,256
  • 12
  • 42

1 Answers1

4

Merge rows (diff) requires both input streams to be sorted by the merge keys (there's a warning about it when you edit the step's properties).

Put a sort rows step in each stream ahead of the Merge Rows (diff) step.

nsousa
  • 4,448
  • 1
  • 10
  • 15
  • Upvoted @nsouza answer. And don't be afraid of time performance or memory issues. The PDI is quite clever. – AlainD Jul 24 '18 at 09:02