README (revision 24652) - OpenGrok cross reference for /csrg-svn/lib/libm/README

*24652Szliu# @(#)README	1.2 (ELEFUNT) 09/08/85
24584Szliu-1.  The machine-independent Version 7 math library found in 4.2BSD
*24652Szliu     is now "/usr/lib/libom.a".  To compile with those routines use -lom.
24584Szliu
24584SzliuK.C. Ng, March 7, 1985, with Z-S. Alex Liu, S. McDonald, P. Tang, W. Kahan.
24584SzliuRevised on 5/10/85, 5/13/85, 6/14/85, 8/20/85, 8/27/85.
24584Szliu
24584Szliu******************************************************************************
24584Szliu*  This is a description of the upgraded elementary functions (listed in 1). *
24584Szliu*  Bessel functions (j0, j1, jn, y0, y1, yn), floor, and fabs passed over    *
24584Szliu*  from 4.2BSD without change except perhaps for the way floating point      *
24584Szliu*  exception is signaled on a VAX.  Three lines that contain "errno in erf.c *
24584Szliu*  (error function erf, erfc) have been deleted to prevent overriding the    *
24584Szliu*  system "errno".                                                           *
24584Szliu******************************************************************************
24584Szliu
24584Szliu0. Total number of files: 40
24584Szliu
24584Szliu        IEEE/Makefile   VAX/Makefile    VAX/support.s   erf.c       lgama.c
24584Szliu        IEEE/atan2.c    VAX/argred.s    VAX/tan.s       exp.c       log.c
24584Szliu        IEEE/cabs.c     VAX/atan2.s     acosh.c         exp__E.c    log10.c
24584Szliu        IEEE/cbrt.c     VAX/cabs.s      asincos.c       expm1.c     log1p.c
24584Szliu        IEEE/support.c  VAX/cbrt.s      asinh.c         floor.c     log__L.c
24584Szliu        IEEE/trig.c     VAX/infnan.s    atan.c          j0.c        pow.c
24584Szliu        Makefile        VAX/sincos.s    atanh.c         j1.c        sinh.c
24584Szliu        README          VAX/sqrt.s      cosh.c          jn.c        tanh.c
24584Szliu
*24652Szliu1. Functions implemented:
*24652Szliu    (A). Standard elementary functions (total 22):
*24652Szliu        acos(x)                 ... in file  "asincos.c"
*24652Szliu        asin(x)                 ... in file  "asincos.c"
*24652Szliu        atan(x)                 ... in file  "atan.c"
*24652Szliu        atan2(x,y)              ... in files "IEEE/atan2.c", "VAX/atan2.s"
*24652Szliu        sin(x)                  ... in files "IEEE/trig.c",  "VAX/sincos.s"
*24652Szliu        cos(x)                  ... in files "IEEE/trig.c",  "VAX/sincos.s"
*24652Szliu        tan(x)                  ... in files "IEEE/trig.c",  "VAX/tan.s"
*24652Szliu        cabs(x,y)               ... in files "IEEE/cabs.c",  "VAX/cabs.s"
*24652Szliu        hypot(x,y)              ... in files "IEEE/cabs.c",  "VAX/cabs.s"
*24652Szliu        cbrt(x)                 ... in files "IEEE/cbrt.c",  "VAX/cbrt.s"
*24652Szliu        exp(x)                  ... in file  "exp.c"
*24652Szliu        expm1(x):=exp(x)-1      ... in file  "expm1.c"
*24652Szliu        log(x)                  ... in file  "log.c"
*24652Szliu        log10(x)                ... in file  "log10.c"
*24652Szliu        log1p(x):=log(1+x)      ... in file  "log1p.c"
*24652Szliu        pow(x,y)                ... in file  "pow.c"
*24652Szliu        sinh(x)                 ... in file  "sinh.c"
*24652Szliu        cosh(x)                 ... in file  "cosh.c"
*24652Szliu        tanh(x)                 ... in file  "tanh.c"
*24652Szliu        asinh(x)                ... in file  "asinh.c"
*24652Szliu        acosh(x)                ... in file  "acosh.c"
*24652Szliu        atanh(x)                ... in file  "atanh.c"
*24652Szliu
24584Szliu    (B). Kernel functions :
*24652Szliu        exp__E(x,c) ... in file "exp__E.c", used by
*24652Szliu		        expm1(), exp(), pow() and cosh()
*24652Szliu        log__L(s)   ... in file "log__L.c", used by
*24652Szliu		        log1p(), log() and pow()
*24652Szliu        libm$argred ... in file "VAX/argred.s", used by VAX version of
*24652Szliu		        sin(), cos() and tan()
24584Szliu
24584Szliu    (C). System supported functions :
*24652Szliu        sqrt()      ... in files "IEEE/support.c", "VAX/sqrt.s"
*24652Szliu        drem()      ... in files "IEEE/support.c", "VAX/support.s"
*24652Szliu        finite()    ... in files "IEEE/support.c", "VAX/support.s"
*24652Szliu        logb()      ... in files "IEEE/support.c", "VAX/support.s"
*24652Szliu        scalb()     ... in files "IEEE/support.c", "VAX/support.s"
*24652Szliu        copysign()  ... in files "IEEE/support.c", "VAX/support.s"
*24652Szliu        rint()      ... in file  "floor.c"
24584Szliu
24584Szliu
24584Szliu   Notes:
*24652Szliu       i. The codes in files ending with ".s" are written in VAX assembly
24584Szliu          language. They are intended for VAX computers.
24584Szliu
*24652Szliu          Files that end with ".c" are written in C. They are intended
24584Szliu          for either a VAX or a machine that conforms to the IEEE
*24652Szliu          standard 754 for double precision floating-point arithmetic.
24584Szliu
24584Szliu      ii. On other than VAX or IEEE machines, run the original math
*24652Szliu          library, formerly "/usr/lib/libm.a", now "/usr/lib/libom.a",
*24652Szliu	  if nothing better is available.
24584Szliu
*24652Szliu     iii. The trigonometric functions sin(), cos(), tan() and atan2() in files
*24652Szliu	  "VAX/sincos.s", "VAX/tan.s" and "VAX/atan2.s" are different from
*24652Szliu	  those in "IEEE/trig.c" and "IEEE/atan2.c".  The VAX assembler code
*24652Szliu	  uses the true value of pi to perform argument reduction, while the
*24652Szliu	  C code uses the machine's value of PI rounded (see "IEEE/trig.c").
24584Szliu
24584Szliu
24584Szliu2. A computer system that conforms to IEEE standard 754 should provide
*24652Szliu	  sqrt(x),
*24652Szliu	  drem(x,p), (double precision remainder function)
*24652Szliu	  copysign(x,y),
*24652Szliu	  finite(x),
*24652Szliu	  scalb(x,N),
*24652Szliu	  logb(x) and
*24652Szliu	  rint(x).
*24652Szliu   These functions are either required or recommended by the standard.
24584Szliu   For convenience, a (slow) C implementation of these functions is
*24652Szliu   provided in the file "IEEE/support.c".
24584Szliu
*24652Szliu   Warning: The functions in "IEEE/support.c" are somewhat machine dependent.
24584Szliu   Some modifications may be necessary to run them on a different machine.
*24652Szliu   Currently, if compiled with a suitable flag, "IEEE/support.c" will work on a
24584Szliu   National 32000, a Zilog 8000, a VAX, and a SUN (cf. the "Makefile" in
24584Szliu   this directory). Invoke the C compiler thus:
24584Szliu
24584Szliu        cc -c -DVAX IEEE/support.c              ... on a VAX, D-format
*24652Szliu        cc -c -DNATIONAL IEEE/support.c         ... on a National 32000
24584Szliu        cc -c  IEEE/support.c                   ... on other IEEE machines,
24584Szliu                                                    we hope.
24584Szliu
24584Szliu   Notes:
24584Szliu      1. Faster versions of "drem" and "sqrt" for IEEE double precision
24584Szliu         (coded in C but intended for assembly language) are given at the
*24652Szliu         end of "IEEE/support.c" but commented out since they require certain
24584Szliu         machine-dependent functions.
24584Szliu
24584Szliu      2. A fast VAX assembler version of the system supported functions
24584Szliu         copysign(), logb(), scalb(), finite(), and drem() appears in file
*24652Szliu         "VAX/support.s".  A fast VAX assembler version of sqrt() is in
*24652Szliu         file "VAX/sqrt.s".
24584Szliu
24584Szliu3. Two formats are supported by all the standard elementary functions:
*24652Szliu   the VAX D-format (56-bit precision), and the IEEE double format
*24652Szliu   (53-bit precision).  The cbrt() in "IEEE/cbrt.c" is for IEEE machines
24584Szliu   only. The functions in files that end with ".s" are for VAX computers
*24652Szliu   only. The functions in files that end with ".c" (except "IEEE/cbrt.c") are
24584Szliu   for VAX and IEEE machines. To use the VAX D-format, compile the code
24584Szliu   with -DVAX; to use IEEE double format on various IEEE machines, see
*24652Szliu   "Makefile" in this directory).
24584Szliu
24584Szliu    Example:
24584Szliu        cc -c -DVAX sin.c               ... for VAX D-format
24584Szliu
24584Szliu       Warning: The values of floating-point constants used in the code are
24584Szliu                given in both hexadecimal and decimal.  The hexadecimal values
*24652Szliu                are the intended ones. The decimal values may be used provided
24584Szliu                that the compiler converts from decimal to binary accurately
24584Szliu                enough to produce the hexadecimal values shown. If the
24584Szliu                conversion is inaccurate, then one must know the exact machine
*24652Szliu                representation of the constants and alter the assembly
*24652Szliu                language output from the compiler, or play tricks like
24584Szliu                the following in a C program.
24584Szliu
24584Szliu                        Example: to store the floating-point constant
24584Szliu
24584Szliu                             p1= 2^-6 * .F83ABE67E1066A (Hexadecimal)
24584Szliu
*24652Szliu                        on a VAX in C, we use two longwords to store its
24584Szliu                        machine value and define p1 to be the double constant
*24652Szliu                        at the location of these two longwords:
24584Szliu
*24652Szliu                        static long  p1x[] = {0x3abe3d78, 0x066a67e1};
24584Szliu                        #define      p1      (*(double*)p1x)
24584Szliu
*24652Szliu    Note:  On a VAX, some functions have two codes. For example, cabs() has
*24652Szliu	   one implementation in "IEEE/cabs.c", and another in "VAX/cabs.s".
*24652Szliu           In this case, the assembly language version is preferred.
24584Szliu
24584Szliu
24584Szliu4. Accuracy.
24584Szliu
24584Szliu            The errors in expm1(), log1p(), exp(), log(), cabs(), hypot()
24584Szliu            and cbrt() are below 1 ULP (Unit in the Last Place).
24584Szliu
24584Szliu            The error in pow(x,y) grows with the size of y. Nevertheless,
24584Szliu            for integers x and y, pow(x,y) returns the correct integer value
24584Szliu            on all tested machines (VAX, SUN, NATIONAL, ZILOG), provided that
24584Szliu            x to the power of y is representable exactly.
24584Szliu
*24652Szliu            cosh(), sinh(), acosh(), asinh(), tanh(), atanh() and log10() have
*24652Szliu	    errors below about 3 ULPs.
24584Szliu
*24652Szliu            For trigonometric and inverse trigonometric functions, let
*24652Szliu	    [trig(x)] denote the value actually computed for trig(x).
24584Szliu
24584Szliu                1) Those codes using the machine's value PI (true pi rounded):
*24652Szliu                   (in files "IEEE/trig.c", "IEEE/atan2.c", "asincos.c" and
*24652Szliu		   "atan.c".)
24584Szliu
24584Szliu                   The errors in [sin(x)], [cos(x)], and [atan(x)] are below
24584Szliu                   1 ULP compared with sin(x*pi/PI), cos(x*pi/PI), and
24584Szliu                   atan(x)*PI/pi respectively, where PI is the machine's
*24652Szliu                   value of pi rounded. [tan(x)] returns tan(x*pi/PI) within
24584Szliu                   about 2 ULPs; [acos(x)], [asin(x)], and [atan2(y,x)]
24584Szliu                   return acos(x)*PI/pi, asin(x)*PI/pi, and atan2(y,x)*PI/pi
24584Szliu                   respectively to similar accuracy.
24584Szliu
*24652Szliu                2) Those using true pi (for VAX D-format only):
*24652Szliu                   (in files "VAX/sincos.s", "VAX/tan.s", "VAX/atan2.s",
*24652Szliu		   "asincos.c" and "atan.c".)
24584Szliu
24584Szliu                   The errors in [sin(x)], [cos(x)], and [atan(x)] are below
*24652Szliu                   1 ULP.  [tan(x)], [atan2(y,x)], [acos(x)], and [asin(x)]
24584Szliu                   have errors below about 2 ULPs.
24584Szliu
24584Szliu            Here are the results of some test runs to find worst errors on
24584Szliu            the VAX :
24584Szliu
24584Szliu    tan   :  2.09 ULPs          ...1,024,000 random arguments (machine PI)
24584Szliu    sin   :  .861 ULPs          ...1,024,000 random arguments (machine PI)
24584Szliu    cos   :  .857 ULPs          ...1,024,000 random arguments (machine PI)
24584Szliu    (compared with tan, sin, cos of (x*pi/PI))
24584Szliu
24584Szliu    acos  :  2.07 ULPs          .....200,000 random arguments (machine PI)
24584Szliu    asin  :  2.06 ULPs          .....200,000 random arguments (machine PI)
24584Szliu    atan2 :  1.41 ULPs          .....356,000 random arguments (machine PI)
24584Szliu    atan  :  0.86 ULPs          ...1,536,000 random arguments (machine PI)
24584Szliu    (compared with (PI/pi)*(atan, asin, acos, atan2 of x))
24584Szliu
24584Szliu    tan   :  2.15 ULPs          ...1,024,000 random arguments (true pi)
24584Szliu    sin   :  .814 ULPs          ...1,024,000 random arguments (true pi)
24584Szliu    cos   :  .792 ULPs          ...1,024,000 random arguments (true pi)
24584Szliu    acos  :  2.15 ULPs          ...1,024,000 random arguments (true pi)
24584Szliu    asin  :  1.99 ULPs          ...1,024,000 random arguments (true pi)
24584Szliu    atan2 :  1.48 ULPs          ...1,024,000 random arguments (true pi)
24584Szliu    atan  :  .850 ULPs          ...1,024,000 random arguments (true pi)
24584Szliu
24584Szliu    acosh :  3.30 ULPs          .....512,000 random arguments
24584Szliu    asinh :  1.58 ULPs          .....512,000 random arguments
24584Szliu    atanh :  1.71 ULPs          .....512,000 random arguments
24584Szliu    cosh  :  1.23 ULPs          .....768,000 random arguments
24584Szliu    sinh  :  1.93 ULPs          ...1,024,000 random arguments
24584Szliu    tanh  :  2.22 ULPs          ...1,024,000 random arguments
24584Szliu    log10 :  1.74 ULPs          ...1,536,000 random arguments
24584Szliu    pow   :  1.79 ULPs          .....100,000 random arguments, 0 < x, y < 20.
24584Szliu
24584Szliu    exp   :  .768 ULPs          ...1,156,000 random arguments
24584Szliu    expm1 :  .844 ULPs          ...1,166,000 random arguments
24584Szliu    log1p :  .846 ULPs          ...1,536,000 random arguments
24584Szliu    log   :  .826 ULPs          ...1,536,000 random arguments
24584Szliu    cabs  :  .959 ULPs          .....500,000 random arguments
24584Szliu    cbrt  :  .666 ULPs          ...5,120,000 random arguments
24584Szliu
24584Szliu
24584Szliu5. Speed.
24584Szliu
*24652Szliu        Some functions coded in VAX assembly language (cabs(), hypot() and
*24652Szliu	sqrt()) are significantly faster than the corresponding ones in 4.2BSD.
*24652Szliu        In general, to improve performance, all functions in "IEEE/support.c"
*24652Szliu        should be written in assembly language and, whenever possible, should
*24652Szliu	be called via short subroutine calls.
24584Szliu
24584Szliu
24584Szliu6. j0,j1,jn.
24584Szliu
24584Szliu        The modifications to these routines were only in how an invalid
*24652Szliu        floating point operation is signaled on a VAX.
24584Szliu
24584Szliu
24584Szliu7. Copyright notice, and Disclaimer:
24584Szliu
24584Szliu***************************************************************************
24584Szliu*                                                                         *
24584Szliu* Copyright (c) 1985 Regents of the University of California.             *
24584Szliu*                                                                         *
24584Szliu* Use and reproduction of this software are granted  in  accordance  with *
24584Szliu* the terms and conditions specified in  the  Berkeley  Software  License *
24584Szliu* Agreement (in particular, this entails acknowledgement of the programs' *
24584Szliu* source, and inclusion of this notice) with the additional understanding *
24584Szliu* that  all  recipients  should regard themselves as participants  in  an *
24584Szliu* ongoing  research  project and hence should  feel  obligated  to report *
24584Szliu* their  experiences (good or bad) with these elementary function  codes, *
24584Szliu* using "sendbug 4bsd-bugs@BERKELEY", to the authors.                     *
24584Szliu*                                                                         *
24584Szliu***************************************************************************