On Thu Oct 5 09:16:57 2023 +0000, Conor McCarthy wrote:
Mainly looking for comment on global vs per device. Windows lacks an equivalent of pthread's callback for freeing memory, so we would need to track devices and free cache memory when the last is released.
I guess that `thread_local`, even if we agreed to jump to C11, would have the same problem?