There was some discussion on whether this object cache is on such a hot path that it requires arch-specific optimizations (instead of just protecting the object cache with a mutex).
After a few tests with Cyberpunk 2077 (using Conor's branch `sm6_rebase`), it seems that yes, there is some performance loss if we just use a mutex instead of the lock-free implementation: specifically, the game calls `CreateConstantBufferView()` a lot of times and from many threads, causing a lot of mutex contention; also, while for other view types creation and destruction involve some Vulkan call, for CBVs there is only some little arithmetic and a few atomic operations, so it is conceivable that a contended mutex impacts performances significantly.