This details how to replicate data to a new cluster:
Can't a similar scheme be used to rapidly scale out a cluster with existing data? Say take a snapshot of all the nodes, copy them all to new nodes, set the tokens in the yaml, set the peers to point to the old instances, and then join them up?
Won't they be treated like nodes that once were part of the cluster and were rejoined?