The bigger question is whether all these tests really have unusual usage patterns, or whether this is simply going to hide memory leaks/excessive memory usage that will impact real apps. And if so, shouldn't the tests instead be fixed to better reflect usage patterns of real apps?