I have an assignment about working with pictures that depth cameras take.
First thing first, this is how one of these pictures looks like.
The way the camera works is that it represents objects in different tones of color depending on how far they are from the camera lens. The closer something is the brighter the color gets. Farther = darker of course.
My assignment is that I basically need to make a people counter for a depth camera that is located in an elevator.
What my idea is that I would basically like to convert the image to a 2D matrix of numbers and then apply a local maximum finder algorithm in order to find those heads sticking out towards the camera so I can count the people in that way.
So, my questions are - Is this an okay approach for someone who is not very experienced? What tools should I use for the conversion? Can I do all of this in C or should I use something more advanced?
I haven't had much experience outside of C but I'm open to trying maybe C# if it provides more advanced and better tools for the job.
All advice appreciated!