The requirement is that the python script be more illustrative than performant.
- Keep it simple (no multiprocessing, that can be a separate step)
- It should take-in images and corresponding labels in batches of 50.
- It should apply the
transformer
(resize, transpose, mean, raw-scale, channel swap) to each image in thesetup
step. - The rest of the network can be a caffe bvlc reference network or Alex net. It could be something simpler if it can better demonstrate that the network in working fine, end-to-end.
- In effect, the python layer should work as if the images and labels are being provided by
type: "ImageData"
(which takes in a text file with image path and label) - As a simplification, no validation code is required.
From the unit-test provided here
, my thought is that the setup
, forward
, and possibly reshape
code needs to be "filled" in (no backward
code needed). My assumption is that the forward
will process images in batches of 50 with a get_next_batch function.