musl/src/math, branch v1.2.2

arm fabs and sqrt: support single-precision-only fpu variants

2020-11-29T05:49:24+00:00

math: new software sqrtl

2020-08-06T03:06:01+00:00

same approach as in sqrt.

sqrtl was broken on aarch64, riscv64 and s390x targets because
of missing quad precision support and on m68k-sf because of
missing ld80 sqrtl.

this implementation is written for quad precision and then
edited to make it work for both m68k and x86 style ld80 formats
too, but it is not expected to be optimal for them.

note: using fp instructions for the initial estimate when such
instructions are available (e.g. double prec sqrt or rsqrt) is
avoided because of fenv correctness.

math: add __math_invalidl

2020-08-06T03:05:57+00:00

for targets where long double is different from double.

math: new software sqrtf

2020-08-06T03:05:36+00:00

same method as in sqrt, this was tested on all inputs against
an sqrtf instruction. (the only difference found was that x86
sqrtf does not signal the x86 specific input-denormal exception
on negative subnormal inputs while the software sqrtf does,
this is fine as it was designed for ieee754 exceptions only.)

there is known faster method:
"Computing Floating-Point Square Roots via Bivariate Polynomial Evaluation"
that computes sqrtf directly via pipelined polynomial evaluation
which allows more parallelism, but the design does not generalize
easily to higher precisions.

math: new software sqrt

2020-08-06T03:05:33+00:00

approximate 1/sqrt(x) and sqrt(x) with goldschmidt iterations.
this is known to be a fast method for computing sqrt, but it is
tricky to get right, so added detailed comments.

use a lookup table for the initial estimate, this adds 256bytes
rodata but it can be shared between sqrt, sqrtf and sqrtl.
this saves one iteration compared to a linear estimate.

this is for soft float targets, but it supports fenv by using a
floating-point operation to get the final result.  the result
is correctly rounded in all rounding modes.  if fenv support is
turned off then the nearest rounded result is computed and
inexact exception is not signaled.

assumes fast 32bit integer arithmetics and 32 to 64bit mul.

add m68k sqrtl using native instruction

2020-08-03T03:31:51+00:00

this is actually a functional fix at present, since the C sqrtl does
not support ld80 and just wraps double sqrt. once that's fixed it will
just be an optimization.

math: add x86_64 remquol

2020-03-24T20:31:36+00:00

math: move x87-family fmod functions to C with inline asm

2020-03-24T20:31:36+00:00

math: move x87-family remainder functions to C with inline asm

2020-03-24T20:31:36+00:00

math: move x87-family rint functions to C with inline asm

2020-03-24T20:31:36+00:00