AMD Metal drivers have a goofy bug where the bbox buffer stops being coherent with the cpu if you copy to it from a private (gpu) buffer and don't do anything else with it in that command buffer.
Turns out it was helpful. (Most improvement in ubershaders.) This time with much better auto mode.
Not worth the extra code