5

So everyone backs their stuff up to the cloud. But I want to back up my stuff from the cloud.

For a planned SaaS service, users are saving files containing their intellectual property and we're saving them in S3. I'm not entirely comfortable with the only copy of a file they are entrusting to me being at a third party. So I'd like to do a good encrypted, deduping, etc. backup from S3 to my on premise (or even from AWS straight to Iron Mountain or something). Anyone have an idea for this (besides "write scripts," yes, I could do that, but I'm happier to use software somewhat meant for the purpose).

The thing that seems to disqualify most normal backup software is that it would need to consume Amazon S3 as a source (not a destination - I know a lot of stuff does it as the destination). I'd want it to encrypt the files and then put them somewhere and ideally not transfer files that haven't changed. I'm actually not all that picky about other details. It can be Linux or Windows based, I can run it on an instance in the cloud or on a server on my premises, it can use whatever transit protocol.

I imagine if there's a good production-ready "mount s3 as a drive" option, then really many backup programs could be brought to bear including "ssh+rsync". I see a couple of these but they all look sketchy.

Is anyone doing this or know how it might be most easily accomplished? Open source is good but I'm also willing to toss some money at it if it gets me nice features.

Ernest Mueller
  • 1,199
  • 2
  • 12
  • 25
  • You're on the right track, multiple copies is the only way to be sure your data is safe. – Nate Oct 13 '10 at 21:40
  • @Nate - in this case, multiple copies isn't what the OP is concerned about. S3 by default, stores multiple copies of each object in multiple datacenters. What Ernst is concerned about is the "all-your-eggs-in-one-basket" (that basket being Amazon AWS) issue, which IMO is a very valid concern. – EEAA Oct 13 '10 at 21:45
  • 2
    @ErikA My point is that it is only as good as one copy since its all under Amazon S3. Even if they have "redundancy" its not an extra copy... – Nate Oct 13 '10 at 22:06

2 Answers2

2

The only thing I've seen that even comes close is s3cmd's sync command; it's bi-directional, which is what you want.

pjz
  • 10,595
  • 1
  • 32
  • 40
0

If you plan to do manual backups try http://www.s3fox.net/

It works well for us.

FrankJK
  • 306
  • 1
  • 3