Worklist for nfloat #2003

fredrik-johansson · 2024-05-27T05:47:46Z

Before I forget anything

fma and fmma that do the right thing when one term is smaller
polynomial multiplication
- block classical, karatsuba, waksman and maybe strassen with fixed-point arithmetic
- block multimodular
all _ui, _si, _fmpz variants
when the precision is more than a few limbs, have multiplication inspect the limbs to see if there are many trailing zeros -> strip off and do a normal mul (see fredrik-johansson@2b2c8a3)
vec_neg
any other important missing vec functions
inv, div, sqrt, rsqrt
transcendental functions
micro-optimization: consider changing the exponent range and redefining the exponent of zero so that one can check x*y == 0 with EXP(x)+EXP(y) < MIN_EXP (saves branches detecting zeros in multiplication and in dot products)

The text was updated successfully, but these errors were encountered:

albinahlback · 2024-05-27T10:24:45Z

I will just state it here as well: For inverses, I believe for small $n$ the fastest method is via Newton iteration (and based off of GMP, I suppose this extends to all numbers). Hardcoded routines for inverses could be implemented for limb counts that are powers of two, just generalizing mpn_invert_limb.

fredrik-johansson · 2024-05-27T14:38:12Z

There is also the basecase algorithm used by mpfr_divhigh_n_basecase and the variant described here: https://inria.hal.science/hal-04557431v1/document

albinahlback · 2024-05-27T19:44:47Z

There is also the basecase algorithm used by mpfr_divhigh_n_basecase and the variant described here: https://inria.hal.science/hal-04557431v1/document

I saw that one. I'm wondering how a fast mpn_invert joined with Granlund-Möller 2n-by-n division algorithm would compare to Sukop's and Zimmermann's new algorithm.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Worklist for nfloat #2003

Worklist for nfloat #2003

fredrik-johansson commented May 27, 2024 •

edited

Loading

albinahlback commented May 27, 2024 •

edited

Loading

fredrik-johansson commented May 27, 2024

albinahlback commented May 27, 2024

Worklist for nfloat #2003

Worklist for nfloat #2003

Comments

fredrik-johansson commented May 27, 2024 • edited Loading

albinahlback commented May 27, 2024 • edited Loading

fredrik-johansson commented May 27, 2024

albinahlback commented May 27, 2024

fredrik-johansson commented May 27, 2024 •

edited

Loading

albinahlback commented May 27, 2024 •

edited

Loading