Doesn't glibc's pthread mutex implementation use futexes already? My quick look at nptl/pthread_mutex_lock.c suggests that it does, at least for some cases. Ideally, that is a better place for that than Wine apart from maybe some specific places, like, e. g., implementing Windows sync primitives. There are a lot of things to consider for general implementation to be used as drop in replacement. And, as a secondary note, a synthetic test measuring sync primitive lock and unlock time in a tight loop is probably not all that convincing, proper justification of the universal benefits is more complicated than that.