@mpaolini  I don't have enough time in these weeks.  Would you try PR-15591?

I confirmed up to 4x speedup.  But I'm afraid about there is performance regression in simple cases.
