0

I'm trying to migrate data from Cassandra to ScyllaDB from snapshot using sstableloader and data in some tables gets loaded without any error but when verifying count using PySpark, it gives less rows in ScyllaDB than in Cassandra. Help needed!

Erick Ramirez
  • 13,964
  • 1
  • 18
  • 23
Varun Nagrare
  • 153
  • 1
  • 11

2 Answers2

1

I work at ScyllaDB

There are two tools that can be used to help find the differences:

Please post a bug on https://github.com/scylladb/scylla/issues if indeed there are missing rows.

Shlomi Livne
  • 467
  • 2
  • 3
  • Thanks for your reply. I solved this problem by using nodetool repair on source Cassandra keyspace and then loaded the snapshot in ScyllaDB using sstableloader. – Varun Nagrare Jan 08 '22 at 15:02
1

Solved this problem by using nodetool repair on Cassandra keyspace, took snapshot and loaded the snapshot in ScyllaDB using sstableloader.

Varun Nagrare
  • 153
  • 1
  • 11