An update to the drawStridedSlow thing: With my a little bit less senseless ProcessVertices implementation, the framerate increased from 1.5 fps to 4.5 fps(CPU0@600 Mhz) and from 2 fps to 6 fps (CPU0@1400 mhz). So I guess it's the garbage I am sending to drawPrimitives that makes it so slow.