http://bugs.winehq.org/show_bug.cgi?id=11674
--- Comment #301 from Stefan Dösinger stefan@codeweavers.com 2013-05-10 05:33:13 CDT --- Fyi, I've done some performance testing with nvidia, r300g and r600g. On r300g and r600g, the glMapBufferRange codepath (i.e., the current one) is a lot faster than glBufferSubData in all tests except 3Dmark2001 (I tested 3dmark2k, 3dmark2k1, half life 2 and my own dynamic buffer test).
On Nvidia, glBufferSubData and glMapBufferRange are pretty equal when the threaded optimizations are off. With threaded optimizations on, glBufferSubData is a lot faster.