I would like to start learning and understanding what and when to optimize in a code meant for real time simulations (especially for games). There are various code snippets out there, most comparing standard trig or sqrt functions to their optimized versions. Some require assembly, others rely on architecture specific strategies and some utilize pure mathematical tricks (such as look up tables or approximation functions). Each version claims to be faster, at a minor precission cost.
To my knowledge, there's no compiled material (e.g. a book) explaining what can be done, including the basics (not anyone knows assembly, SSE, SIMD, MMX, FPU, etc.). There are, however, a plethora of books aimed at how to implement some numerical methods (such books are good, but they're not stressing the importance of fast tid-bits because the huge majority of those books do not even include architecture specific code or floating point discussions from a developer's point of view).
So, could anyone please share a very short list of books or other resources that best fit to these scenarios?
P.S.: I've the impression that "Numerical recipes in C++.." or "C++ for Scientists and Engineers.." or "C++ for Scientific Computing.." do not contain a lot (if any) information on this issue.