How does Google store and organize documents, such as documents in Google Docs? I'd like to ask which file system Google uses, but I know it uses GFS, a distributed file system for storing huge files, basically the huge databases containing, among other things, also the Documents I am interested in.
My question is: Is each document a record in a DB? And how does it identify documents in a hierarchical system, such as web pages? How does it relate them, or represent the hierarchical structure, if needed?
It looks like Google created his own "file system" in a DB (besides the underlying GFS). Does anyone know any specification or its working?