google search engine architecture- how do so many concurrent users do a search on it

Question

With millions of users searching for so many things on google, yahoo and so on. How can the server handle so many concurrent searches? I have no clue as to how they made it so scalable. Any insight into their architecture would be welcomed.

miku · Accepted Answer · 2019-11-11T09:38:12.407

7

One element, DNS load balancing.

There are plenty of resources on google architecture, this site has a nice list:

http://highscalability.com/google-architecture

edited Nov 11 '19 at 09:38

answered May 29 '10 at 17:04

miku

181,842
47
306
310

@GezaKerecsenyi, thanks - removed. It seems google currently uses a DNS load balancing, that moves more control to Google, in that they only give out a single IP (as opposed to multiple ones, as a few years ago) - that single google IP is request IP dependent - so they can more predictable shift loads around. – miku Nov 11 '19 at 09:41

score 4 · Answer 2 · answered May 29 '10 at 17:46

DNS Load Balancing is correct, but it is not really the full answer to the question. Google uses a multitude of techniques, including but not limited to the following:

DNS Load Balancing (suggested)
Clustering - as suggested, but note the following
- clustered databases (the database storage and retrieval is spread over many machines)
- clustered web services (analogous to DNSLB here)
- An internally developed clustered/distributed filing system
Highly optimised search indices and algorithms, making storage efficient and retrieval fast across the cluster
Caching of requests (squid), responses (squid), databases (in memory, see shards in the above article)

score 3 · Answer 3 · answered May 29 '10 at 17:15

I've gone searching for information about this topic recently and Wikipedia's Google Platform article was the best all around source of information on how Google does it. However, the High Scalability blog has outstanding articles on scalability nearly every day. Be sure to check it out their Google architecture article too.

Ramesh Tabarna · Answer 4 · 2010-05-29T17:49:35.077

The primary concept in most of the highly scalable applications is clustering.

Some resources regarding the cluster architecture of different search engines.

You can also read interesting research articles at Google Research and Yahoo Research.

google search engine architecture- how do so many concurrent users do a search on it

4 Answers4