My target is to use Microsoft face API cognitive service to detect the faces in a frame , then using the landmarks returned for each face I would track it using optical flow for example!.
My question is about the accuracy .. is this approach would work properly , or there are some other logical constrains exists behind tracking face using its landmarks?