Yeah. I don't know why we didn't do this right from the beginning. Or apply any of the other accumulated knowledge from the TestBot, for that matter.
We can certainly tweak things, but I see no evidence that the Testbot runs consistently have better results than the Gitlab ones.
And honestly, if the tests can only succeed in a carefully tuned environment, that's not much better than when they only succeeded on my box. We need to accept that Wine runs in various environments with various upstream bugs, and deal with that. If upstream bugs have a high impact we need to work around them in the code; if we decide that the impact is low enough not to bother, then it should be OK to ignore the failures in the tests.
There may be some unusual configurations that we decide not to support, and maybe fvwm focus is one of them, but that should be the last resort. We need to stop using upstream bugs as an excuse to not fix the tests.