I have datasets with 9081 256x256 RGB images containing 5 peoples with three possible actions: forward/backward, wave hands, and forward bend.
Based on my application, I want to convert different human bodies into human pose template to achieve something like this:
(web: https://reurl.cc/Y9LvRD)
Namely, I don't need too many features such as colors and face, which will affect my neural network.
May i have some suggestions? Any help is much appreciated:)