I'm using PyTables to store some images as Array
and CArray
data types. For each of these images, I also want to store some basic metadata (e.g., EXIF data).
I can imagine a number of approaches to storing both of these data formats, ranging from storing the metadata with the AttributeSet
class for each Array
/CArray
to using a Table
for all of the metadata.
My question is: What is the best approach if I want to be able to efficiently query and extract images from the ultimate hdf5 file for processing? For example, I'd like to be able to extract images taken at certain times (12-3pm) and process that subset of the data and then insert copies into the database or replace the existing arrays.
Many thanks for the help.
Best,
Nick
[Edit (clarification): I'm currently processing these images as NumPy arrays, and I hope to preserve that functionality]