Why does Dropbox use so many threads?

Question

My understanding of threads is that you can only have one thread per core, two with hyper threading, before you start losing efficiency.

This computer has eight cores and so should work best with 8/16 threads then, yet many applications use several times that, especially Dropbox.

Dropbox windows 7 process, 104 threads highlighted.

It also uses 95 threads while idling on my laptop, which only has 4 cores.

Why is this the case? Does it have so many threads for programming convenience, have I misunderstood threading efficiency or is it something else entirely?

Waiting threads do not use any significant amount of resources; it's only when all 104 try to run together that things would get troublesome, but there may not be many cases where more than a few are active. — Ken Y-N, Apr 24 '17 at 08:29
Ah, I see. So the windows 7 (and I believe it shows a similar view on macs) processes tab in Task Manager shows all the created threads, not necessarily the running ones. Thanks. — Nick Coughlin, Apr 24 '17 at 08:47
[Looking over in SuperUser](https://superuser.com/a/462970/347185), I just downloaded that tool and double-clicking on a process and selecting the Threads tab shows me for `lync.exe` it has 57 threads, yet at any point in time only four or five are active, each using under 0.01% of my CPU. I'm sure you'll see similar results for `DropBox`. — Ken Y-N, Apr 24 '17 at 09:10

score 2 · Answer 1 · answered Feb 26 '21 at 18:44

I took a peek at the Mac version of the client, and it seems to be written in Python and it uses several frameworks.

A bunch of threads seem to be used in some in house actor system
They use nucleus for app analytics
There seems to be a p2p network
some networking threads (one per hype core)
a global pool (one per physical core)
many threads for file monitoring and thumbnail generation
task schedulers
logging
metrics
db checkpointing
something called infinite configuration
etc.

Most are idle.

It looks like a hodgepodge of subsystems, each starting their own threads, but they don't seem too expensive in terms of memory or CPU.

David Schwartz · Answer 2 · 2021-02-26T19:39:33.017

My understanding of threads is that you can only have one thread per core, two with hyper threading, before you start losing efficiency.

Nope, this is not true. I'm not sure why you think that, but it's not true.

As just the most obvious way to show that it's false, suppose you had that number of threads and one of them accessed a page of memory that wasn't in RAM and had to be loaded to disk. If you don't have any other threads that can run, then one core is wasted for the entire time it takes to read that page of memory from disk.

It's hard to address the misconception directly without knowing what flawed chain of reasoning led to it. But the most common one is that if you have more threads ready-to-run than you can execute at once, then you have lots of context switches and context switches are expensive.

But that is obviously wrong. If all the threads are ready-to-run, then no context switches are necessary. A context switch is only necessary if a running thread stops being ready-to-run.

If all context switches are voluntary, then the implementation can select the optimum number of context switches. And that's precisely what it does.

Having large numbers of threads causes you to lose efficiency if, and only if, lots of threads do a small amount of work and then become no longer ready-to-run while other waiting threads are ready-to-run. That forces the implementation to do a context even where it is not optimal.

Some applications that use lots of threads do in fact do this. And that does result in poor performance. But Dropbox doesn't.

Why does Dropbox use so many threads?

2 Answers2