What algorithm would be most useful for real time detecting deformable ROI (not object) in an video frame?

Question

I am working on a project in MATLAB (with later implementation in C++ or possibly Python, with OpenCV) where when a real time video starts, the user may draw a region of interest. It will be drawn out as polygon. However, this ROI may get out of the frame and then reappear. At this point, it should be recognized and detected. The main issue is that it is not a salient object with particular physical characteristics since it can change (to some degree) shape, bend, stretch. And also, it is supposed to be run in real time which kind of makes it hard to implement training.

I tried using color histograms to emphasize color features of the ROI, and then threshold the image but it works only in specific conditions. I do have engineering background, but have not used any Machine Learning before this. I read that HOG features with SVM may help, or even Hough transform. I got some good results using ACF, but it still requires a training phase for the classifier.

The goals of this is to re-initialize trackers within this region after they have been totally lost. I know that having a classifier trained, it will be possible to recognize it through a number of algorithms, but the problem I have is doing it in real time and without any previous training.

Since the video is surgical, I have uploaded 4 example frames of what the bounding box of user's ROI would be.

(Explicit photos of surgery below, hover above the region with the mouse or click to reveal them.)

Any suggestion would be of great help.

I know people who do this with CNNs, buts its very hard – Ander Biguri May 17 '19 at 11:01 — Ander Biguri, May 17 '19 at 11:01

score 1 · Answer 1 · answered May 17 '19 at 10:44

1

Because thi ROI is decided at the real time so that I dont think the function with pre-training would work here. Recently, i caught a topic that helped to find a sticker in the image. The suggested solution is template matching.

The steps are capturing the ROI and use template matching to search for "the best similar" region in every frame.

Because in your case, the object could be changing (not too much) over time, the result may not perfect but it is the good approach.

A Picture from mentioned topic

answered May 17 '19 at 10:44

Peter Lee

381
3
9

Having it run real time creates additional problems, especially without deep prior CS/AI knowledge. However, using template matching from OpenCV gives some usable results. Not ideal, but more than a good starting point. Thanks! – Martin V May 17 '19 at 20:00

What algorithm would be most useful for real time detecting deformable ROI (not object) in an video frame?

1 Answers1