On a linux box running postfix+amavis+spamassassin, we are thinking of implementing bayes filtering. This system already does spam filtering (without bayes) at the moment for multiple customer domains.
The question is, how should training be done in this scenario? Would we need to collect spam and ham from each client or would just one do and have a global database?
Thanks.