Reading from Wine NT devices is currently quite slow and heavy and I don't think we want to add that overhead to XInputGetState. Also it's a much larger change, and I'm not sure to see what advantage it has.
Fwiw XInputSetState is also supposed to be fast and asynchronous. It currently isn't and we instead avoid writing output reports if nothing changed, but that's probably incorrect and we should delegate the I/O to the thread instead.