Title: GitHub Actions CI/CD `Coverage` job is broken on master
msg391373 - (view) Author: Sviatoslav Sydorenko (webknjaz) * Date: 2021-04-19 11:50
I noticed that (a coverage job on the last commit on master at the time of writing) takes suspiciously long to complete.

I did some investigation and noticed that this job on the 3.9 branch succeeds (all of the job runs on the first page in the list are green —

But then I took a look at the runs on master and discovered that the last successful run was 4 months ago —

The last success is and after that, starting with, if fails consistently.

Notably, all of the failures are caused by the job timeout after *6 hours* — GitHub platform just kills those, 6h is a default per-job timeout in GHA.

It's also important to mention that before every job starting timing out effectively burning 6 hours of GHA time for each merge and producing no useful reports, there were occasional 6h-timeouts but they weren't consistent.

Looking into the successful runs from the past, on master and other jobs, I haven't noticed it taking more than 1h35m to complete with a successful outcome. Taking into account this as a baseline, I suggest changing the timeout of the whole job or maybe just one step that actually runs coverage.

Action items:
* Set job timeout in GHA to 1h40m (allowing a bit of extra time for exceptionally slow jobs) — this will make sure that the failure/timeout is reported sooner than 6h
* Figure out why this started happening in the first place.

I'm going to send a PR addressing the first point but feel free to pick up the investigation part — I don't expect to have time for this anytime soon.

P.S. FTR the last timeout of this type happened two months ago —
msg391412 - (view) Author: Brett Cannon (brett.cannon) * (Python committer) Date: 2021-04-19 22:44
It might be time to just kill the coverage report since people are obviously not looking at the results.
msg391455 - (view) Author: miss-islington (miss-islington) Date: 2021-04-20 17:22
New changeset 077a2e76649c2fc49ebb8982b6855bab09a85e8f by Sviatoslav Sydorenko in branch 'master':
bpo-43888: Reduce coverage collection timeout to 1h40m in GHA (GH-25471)
msg391836 - (view) Author: Batuhan Taskaya (BTaskaya) * (Python committer) Date: 2021-04-25 04:33
Coverage runs are still failing on the master, and I think at least we should do something like allow failure or other wise github will send notifications for this flaky run.
msg391837 - (view) Author: Ammar Askar (ammar2) * (Python committer) Date: 2021-04-25 04:36
For what it's worth I think Brett's suggestion of just removing the coverage build entirely is good too since it seems like no one actually looks at the results and they take up valuable CI time.
msg392514 - (view) Author: Brett Cannon (brett.cannon) * (Python committer) Date: 2021-04-30 21:04
New changeset 726c931b3896dc73fd156e2340b5ef0b8f55cfb7 by Ammar Askar in branch 'master':
bpo-43888: Remove coverage builds from CI (GH-25679)
msg392527 - (view) Author: Ammar Askar (ammar2) * (Python committer) Date: 2021-04-30 22:41
Coverage builds have been removed from the CI, closing this now.
