I am looking for some triangular solvers, and I have come across two solvers. One in BLAS: dtrsm and another in LAPACK: dtrtrs. From the looks of it both seem to have common functionality, with dtrsm having a little bit more functionality (scaling the right hand side before solving the system).
I would like to know
1) How else do these functions differ ?
2) When performing the same operation, which is faster ?
3) If the answer to (2) is not obvious, When is dtrsm suggested over dtrtrs and vice versa ?