`exp2()` works incorrectly when 24-bit precision is used (`_control87(_PC_24, _MCW_PC)`). It's probably better to change the implementation so it's not affected by precision set on function call.
I guess there's similar problem in other functions. I'm not sure what's the best way of addressing it (I guess it may be side effect of using SSE).