This implementation has some issues, e.g. a generated GLSL shader will produce a compilation error when the immediate constant buffer contains an unsigned integer which represents NaN. Morever, from testing a similar implementation, it seems that some drivers don't handle shaders with huge arrays very well and such shaders perform poorly. My current impression is that we should implement immediate constant buffers using a uniform variable.
Also, a test would be nice.
On Mon, Jun 13, 2016 at 6:41 PM, Guillaume Charifi guillaume.charifi@sfr.fr wrote:
I think it would be better to store the number of vectors instead of the element count in "reg_maps->icb". Not that it matters much, but this block of code would certainly look nicer.
On 13 June 2016 at 20:32, Józef Kucia joseph.kucia@gmail.com wrote:
Which drivers are that? Using uniforms does have some advantages, but it also means the GLSL compiler potentially has less opportunities for optimisation. Perhaps that's ok if we can safely assume any such optimisations would have already been done by the HLSL compiler. There's also the consideration that accessing uniforms can potentially have higher latency than accessing immediate values, although I'm not aware of any hardware that supports arrays of immediate values, as opposed to single (vec4) values.
On Wed, Jun 15, 2016 at 1:35 PM, Henri Verbeet hverbeet@gmail.com wrote:
Switching from arrays to uniforms improved performance noticeably for Gauntlet on OS X and on Linux with Nvidia binary drivers.
Mesa seems to lower const arrays to uniforms anyway. However, we cannot declare these arrays as const when uintBitsToFloat is used (at least in GLSL 1.30), so we are at risk of spilling these arrays to scratch memory. An excerpt from a comment in the Mesa source code: "Lower constant arrays to uniform arrays. Some driver backends (such as i965 and nouveau) don't handle constant arrays gracefully, instead treating them as ordinary writable temporary arrays. Since arrays can be large, this often means spilling them to scratch memory, which usually involves a large number of instructions.".
On 15 June 2016 at 15:22, Józef Kucia joseph.kucia@gmail.com wrote:
Switching from arrays to uniforms improved performance noticeably for Gauntlet on OS X and on Linux with Nvidia binary drivers.
Ok, that's good to know.
You don't necessarily need uintBitsToFloat() when you declare the icb though, you can also do that conversion in shader_glsl_get_register_name() or shader_glsl_add_src_param().
On Wed, Jun 15, 2016 at 3:49 PM, Henri Verbeet hverbeet@gmail.com wrote:
Yeah, we could declare ICB as constant unsigned integer array. I guess we could spend more time to figure out if this approach works well but I am not sure if there is point in doing so. If Mesa lowers const arrays to uniforms we can use uniforms directly as well.
On 15 June 2016 at 15:57, Józef Kucia joseph.kucia@gmail.com wrote:
Oh sure, I think using uniforms makes sense.