Amazon Echo - the algorithm to avoid the ambiguous background command from the voice record being played:
This is a hypothetical question based on the principle that it could happen: If one uses Amazon Echo to play a voice recording or a video which contains some sentences like "Alexa, could you ... , " What will happen?
If the video/recording contains a sentence like: "Alexa, could you stop the video?" What will happen?
If the video/recording contains a sentence like: "Alexa, please increase the volume to 8?" Meanwhile, you command to the Echo that "Alexa, please decrease the volume to 4?" Could it distinguish which one is the command to fulfill?
Would Amazon Echo be able to neglect the voice recording or the video being played, and not to misunderstand it as a real command from the real human? What kind of algorithm is designed for the Amazon Echo program to deal with this situation?