Inter procedural optimization
These patches enable inter procedural optimization for the GCC and
Intel compilers. They may also work for the LLVM/clang.
Further response to #104.
Matthieu, I tried these and didn't see any improvements.
Can you also check please.
See merge request !101