The celery docs don't have too much information about the optimal number of celery workers to use per machine. I believe by default celery creates one worker per machine core.
I know from experimentation that on a single core machine, starting more celery workers is definitely beneficial (default is 1 worker because of 1 core). I'm looking for the threshold where adding more workers has marginally diminishing returns -- the optimal worker number per core. I am currently using a celery daemon with the daemon config file celeryd being having this line:
CELERYD_NODES="worker1 worker2 worker3"
My intention is to create 3 workers PER CORE (so if I started a 4 core machine, there would be 12 workers total). Am I doing this correctly, or will this only start 3 workers regardless of number of cores?