Given:
Somekind of data import of an external source. The data can be read in chunks of defined size. For example 10 items at once. For Example Emails.
Now each chunk has to pass some steps which transform the data, filters items out and so on.
There is no relation between the chunks or the items of the chunks. Also the order of processing isn't important
Question
Now I'm thinking about what kind of structure would be the right if I do this with akka to have the best parallization and performance.
1.) Would I more likely create all actors as a chain of children. So that the importActor has a Child which is the first step. and the first step has the second step as child and so son.
Or more likely have one ImportActor which has all steps and calls one after the Other?
2.)Now one actor can now only process one message a time. To Parallize the import process I think about using the PipeTo mechanism. is this a good idea? are there better options?
3.) Would I create for each chunk an actor like "Import_Chunk1_Actor" or would i push all messages to the single "ImportActor" ?