Is there a way to use SSE when available and x86 when it's not.  IIRC,
floating point used to work that way (using a C lib as a fallback on
systems w/o coprocessor support).
