Author christian.heimes
Date 2018-06-19.08:58:45
I read the patch. The manual loop unrolling is ugly but mostly ok. But 
I'm -1 on the "-O0" option and won't accept the patch unless it works with standard optimization. sha3 is already slow. Without optimization, the algorithm grinds to a halt.

Did anybody contact Intel and reported the bug? It's clearly a bug in ICC. GCC, clang and MSVC compile the code just fine.
