0

How to use Sqoop incremental lastmodified if source table does not have timestamp column or any such date column and client is not allowing to make any changes in source table.Please Guide!

user8167344
  • 353
  • 2
  • 6
  • 17

1 Answers1

1

1. Your source table is never updated, just append

You fetch new inserted rows based on the primary autoincrement key by specifying the last row you integrated previously.

2. Your source table is both updated and inserted

In case your source table is also updated, the only way you have is to fetch the entire table and compare source & target with a hash function based on all columns. You can discover yourself the rows that have been modified by comparing all the columns using the hash() function on the new table and the hive table There is several way to update a hive table:

parisni
  • 920
  • 7
  • 20