Seems to be fairly consistent on Windows but more random on Linux. I have also triggered it on my OS X machine randomly.

Can't tell if it's a timing issue or some other test doing something bad. I'm worried solving it is going to require taking one of the failing instance's list of tests and then slowly pruning it down to find the trigger (if it's not a timing thing).
