If I remember right you reported that the game ran poorly on that Nvidia card - though I am not sure if you linked that to the driver's handling of ddraw4 buffers. If my memory is correct here I wager that it has more to do with the fact that the game creates a 1 MB buffer than how the driver handles the buffer.
I believe the opposite is correct actually. The standalone tests I ran for VB performance used a fixed size buffer of 800 vertices, and that also performed very badly with the NVidia card when using the same pattern as the game did (namely, no discard/nooverwrite, since ddraw4.) So even if the driver reported a lower dwMaxVertexCount value, the game would still perform badly.