What file systems can be used for checkpointing

Question

The documentation says that any Hadoop API compatible file systems ( like HDFS , S3 ) can be used as checkpoint directory.

My question is that apart from HDFS and S3 what are other practical alternatives for a spark streaming application using Kafka and Cassandra.

Thanks

score 0 · Answer 1 · answered Jan 07 '16 at 01:34

0

You can use any type of distributed file system like Gluster, GFS, Luster and many more but provided the protocol used by the underlying filesystem should be supported by Spark API's.

answered Jan 07 '16 at 01:34

Sumit

1,400
7
9

What file systems can be used for checkpointing

1 Answers1