I am working on a basic image detection neural network in Tensorflow. It has been trained to identify foods with around 94% accuracy. I have wondered whether it is possible to supply text data with the image to the neural net to improve accuracy. For example, if a sugary sweet were to be identified, the text input could give information about the general age that eats that type of food (mainly children), any health effects that may be caused by it (diabetes), and other general information. Is this possible in TensorFlow? If so, what libraries are used? I have searched online and have found nothing.
Thank you