README (revision 24584) - OpenGrok cross reference for /csrg-svn/lib/libm/README

*24584Szliu# @(#)README	1.1 (ELEFUNT) 09/06/85
*24584Szliu-1.  The machine-independent Version 7 math library found in 4.2BSD
*24584Szliu     is now /usr/lib/libom.a.  To compile with those routines use -lom.
*24584Szliu
*24584SzliuK.C. Ng, March 7, 1985, with Z-S. Alex Liu, S. McDonald, P. Tang, W. Kahan.
*24584SzliuRevised on 5/10/85, 5/13/85, 6/14/85, 8/20/85, 8/27/85.
*24584Szliu
*24584Szliu******************************************************************************
*24584Szliu*  This is a description of the upgraded elementary functions (listed in 1). *
*24584Szliu*  Bessel functions (j0, j1, jn, y0, y1, yn), floor, and fabs passed over    *
*24584Szliu*  from 4.2BSD without change except perhaps for the way floating point      *
*24584Szliu*  exception is signaled on a VAX.  Three lines that contain "errno in erf.c *
*24584Szliu*  (error function erf, erfc) have been deleted to prevent overriding the    *
*24584Szliu*  system "errno".                                                           *
*24584Szliu******************************************************************************
*24584Szliu
*24584Szliu0. Total number of files: 40
*24584Szliu
*24584Szliu        IEEE/Makefile   VAX/Makefile    VAX/support.s   erf.c       lgama.c
*24584Szliu        IEEE/atan2.c    VAX/argred.s    VAX/tan.s       exp.c       log.c
*24584Szliu        IEEE/cabs.c     VAX/atan2.s     acosh.c         exp__E.c    log10.c
*24584Szliu        IEEE/cbrt.c     VAX/cabs.s      asincos.c       expm1.c     log1p.c
*24584Szliu        IEEE/support.c  VAX/cbrt.s      asinh.c         floor.c     log__L.c
*24584Szliu        IEEE/trig.c     VAX/infnan.s    atan.c          j0.c        pow.c
*24584Szliu        Makefile        VAX/sincos.s    atanh.c         j1.c        sinh.c
*24584Szliu        README          VAX/sqrt.s      cosh.c          jn.c        tanh.c
*24584Szliu
*24584Szliu1. Functions implemented :
*24584Szliu    (A). Standard elementary functions (total 22) :
*24584Szliu        acos(x)                 ...in file  asincos.c
*24584Szliu        asin(x)                 ...in file  asincos.c
*24584Szliu        atan(x)                 ...in file  atan.c
*24584Szliu        atan2(x,y)              ...in files IEEE/atan2.c, VAX/atan2.s
*24584Szliu        sin(x)                  ...in files IEEE/trig.c , VAX/sincos.s
*24584Szliu        cos(x)                  ...in files IEEE/trig.c , VAX/sincos.s
*24584Szliu        tan(x)                  ...in files IEEE/trig.c , VAX/tan.s
*24584Szliu        cabs(x,y)               ...in files IEEE/cabs.c , VAX/cabs.s
*24584Szliu        hypot(x,y)              ...in files IEEE/cabs.c , VAX/cabs.s
*24584Szliu        cbrt(x)                 ...in files IEEE/cbrt.c , VAX/cbrt.s
*24584Szliu        exp(x)                  ...in file  exp.c
*24584Szliu        expm1(x):=exp(x)-1      ...in file  expm1.c
*24584Szliu        log(x)                  ...in file  log.c
*24584Szliu        log10(x)                ...in file  log10.c
*24584Szliu        log1p(x):=log(1+x)      ...in file  log1p.c
*24584Szliu        pow(x,y)                ...in file  pow.c
*24584Szliu        sinh(x)                 ...in file  sinh.c
*24584Szliu        cosh(x)                 ...in file  cosh.c
*24584Szliu        tanh(x)                 ...in file  cosh.c
*24584Szliu        asinh(x)                ...in file  asinh.c
*24584Szliu        acosh(x)                ...in file  acosh.c
*24584Szliu        atanh(x)                ...in file  atanh.c
*24584Szliu
*24584Szliu    (B). Kernel functions :
*24584Szliu        exp__E(x,c) ...in file exp__E.c, used by expm1/exp/pow/cosh
*24584Szliu        log__L(s)   ...in file log__L.c, used by log1p/log/pow
*24584Szliu        libm$argred ...in file VAX/argred.s, used by VAX/tan.s and VAX/sincos.s
*24584Szliu
*24584Szliu    (C). System supported functions :
*24584Szliu        sqrt()      ...in files IEEE/support.c , VAX/sqrt.s
*24584Szliu        drem()      ...in files IEEE/support.c , VAX/support.s
*24584Szliu        finite()    ...in files IEEE/support.c , VAX/support.s
*24584Szliu        logb()      ...in files IEEE/support.c , VAX/support.s
*24584Szliu        scalb()     ...in files IEEE/support.c , VAX/support.s
*24584Szliu        copysign()  ...in files IEEE/support.c , VAX/support.s
*24584Szliu        rint()      ...in file  floor.c
*24584Szliu
*24584Szliu
*24584Szliu   Notes:
*24584Szliu       i. The codes in files ending with .s are written in VAX assembly
*24584Szliu          language. They are intended for VAX computers.
*24584Szliu
*24584Szliu          Files that end with .c are written in C. They are intended
*24584Szliu          for either a VAX or a machine that conforms to the IEEE
*24584Szliu          standard 754 for (double-precision) floating-point arithmetic.
*24584Szliu
*24584Szliu      ii. On other than VAX or IEEE machines, run the original math
*24584Szliu          library, formerly libm.a, now libom.a, if nothing better
*24584Szliu          is available.
*24584Szliu
*24584Szliu     iii. The trigonometric functions sin/cos/tan/atan2 in files "VAX/sincos.s",
*24584Szliu          "VAX/tan.s" and "VAX/atan2.s" are different from those in
*24584Szliu          "IEEE/trig.c" and "IEEE/atan2.c".  The VAX assembler code uses the
*24584Szliu          true value of pi to perform argument reduction, while the C code uses
*24584Szliu          a machine value of PI (see "IEEE/trig.c").
*24584Szliu
*24584Szliu
*24584Szliu2. A computer system that conforms to IEEE standard 754 should provide
*24584Szliu                sqrt(x),
*24584Szliu                drem(x,p), (double precision remainder function)
*24584Szliu                copysign(x,y),
*24584Szliu                finite(x),
*24584Szliu                scalb(x,N),
*24584Szliu                logb(x) and
*24584Szliu                rint(x).
*24584Szliu   These functions are required or recommended by the standard.
*24584Szliu   For convenience, a (slow) C implementation of these functions is
*24584Szliu   provided in the file IEEE/support.c.
*24584Szliu
*24584Szliu   Warning: The functions in IEEE/support.c are somewhat machine dependent.
*24584Szliu   Some modifications may be necessary to run them on a different machine.
*24584Szliu   Currently, if compiled with a suitable flag, IEEE/support.c will work on a
*24584Szliu   National 32000, a Zilog 8000, a VAX, and a SUN (cf. the "Makefile" in
*24584Szliu   this directory). Invoke the C compiler thus:
*24584Szliu
*24584Szliu        cc -c -DVAX IEEE/support.c              ... on a VAX, D-format
*24584Szliu        cc -c -DNATIONAL IEEE/support.c         ... on a National 32081
*24584Szliu        cc -c  IEEE/support.c                   ... on other IEEE machines,
*24584Szliu                                                    we hope.
*24584Szliu
*24584Szliu   Notes:
*24584Szliu      1. Faster versions of "drem" and "sqrt" for IEEE double precision
*24584Szliu         (coded in C but intended for assembly language) are given at the
*24584Szliu         end of support.c but commented out since they require certain
*24584Szliu         machine-dependent functions.
*24584Szliu
*24584Szliu      2. A fast VAX assembler version of the system supported functions
*24584Szliu         copysign(), logb(), scalb(), finite(), and drem() appears in file
*24584Szliu         VAX/support.s.  A fast VAX assembler version of sqrt() is in
*24584Szliu         file sqrt.s .
*24584Szliu
*24584Szliu3. Two formats are supported by all the standard elementary functions:
*24584Szliu   the VAX D-format (56 bits' precision), and the IEEE double format
*24584Szliu   (53 bits' precision).  The cbrt() in IEEE/cbrt.c is for IEEE machines
*24584Szliu   only. The functions in files that end with ".s" are for VAX computers
*24584Szliu   only. The functions in files that end with ".c" (except IEEE/cbrt.c) are
*24584Szliu   for VAX and IEEE machines. To use the VAX D-format, compile the code
*24584Szliu   with -DVAX; to use IEEE double format on various IEEE machines, see
*24584Szliu   Makefile in this directory).
*24584Szliu
*24584Szliu    Example:
*24584Szliu        cc -c -DVAX sin.c               ... for VAX D-format
*24584Szliu
*24584Szliu       Warning: The values of floating-point constants used in the code are
*24584Szliu                given in both hexadecimal and decimal.  The hexadecimal values
*24584Szliu                are the intended ones. The decimal values may be use provided
*24584Szliu                that the compiler converts from decimal to binary accurately
*24584Szliu                enough to produce the hexadecimal values shown. If the
*24584Szliu                conversion is inaccurate, then one must know the exact machine
*24584Szliu                representation of the constants and alter the assembly-
*24584Szliu                language output from the compiler, or apply tricks like
*24584Szliu                the following in a C program.
*24584Szliu
*24584Szliu                        Example: to store the floating-point constant
*24584Szliu
*24584Szliu                             p1= 2^-6 * .F83ABE67E1066A (Hexadecimal)
*24584Szliu
*24584Szliu                        on a VAX in C, we use two long word to store its
*24584Szliu                        machine value and define p1 to be the double constant
*24584Szliu                        at the location of these two long words:
*24584Szliu
*24584Szliu                        static long  p1x[] = { 0x3abe3d78, 0x066a67e1};
*24584Szliu                        #define      p1      (*(double*)p1x)
*24584Szliu
*24584Szliu    Note:  On a VAX, some functions have two codes. For example, cabs()
*24584Szliu           has one implementation in cabs.c, and another in VAX/cabs.s.
*24584Szliu           In this case, the assembly version is preferred.
*24584Szliu
*24584Szliu
*24584Szliu4. Accuracy.
*24584Szliu
*24584Szliu            The errors in expm1(), log1p(), exp(), log(), cabs(), hypot()
*24584Szliu            and cbrt() are below 1 ULP (Unit in the Last Place).
*24584Szliu
*24584Szliu            The error in pow(x,y) grows with the size of y. Nevertheless,
*24584Szliu            for integers x and y, pow(x,y) returns the correct integer value
*24584Szliu            on all tested machines (VAX, SUN, NATIONAL, ZILOG), provided that
*24584Szliu            x to the power of y is representable exactly.
*24584Szliu
*24584Szliu            cosh, sinh, acosh, asinh, tanh, atanh and log10 have errors below
*24584Szliu            about 3 ULPs.
*24584Szliu
*24584Szliu            For trigonometric and inverse trigonometric functions:
*24584Szliu
*24584Szliu                Let [trig(x)] denote the value actually computed for trig(x),
*24584Szliu
*24584Szliu                1) Those codes using the machine's value PI (true pi rounded):
*24584Szliu                   (source codes: IEEE/{trig.c,atan2.c}, asincos.c and atan.c)
*24584Szliu
*24584Szliu                   The errors in [sin(x)], [cos(x)], and [atan(x)] are below
*24584Szliu                   1 ULP compared with sin(x*pi/PI), cos(x*pi/PI), and
*24584Szliu                   atan(x)*PI/pi respectively, where PI is the machine's
*24584Szliu                   value of pi rounded. [Tan(x)] returns tan(x*pi/PI) within
*24584Szliu                   about 2 ULPs; [acos(x)], [asin(x)], and [atan2(y,x)]
*24584Szliu                   return acos(x)*PI/pi, asin(x)*PI/pi, and atan2(y,x)*PI/pi
*24584Szliu                   respectively to similar accuracy.
*24584Szliu
*24584Szliu
*24584Szliu                2) Those using true pi (for VAX D-format only)
*24584Szliu                   (source codes: VAX/{sincos.s,tan.s,atan2.s}, asincos.c and
*24584Szliu                   atan.c)
*24584Szliu
*24584Szliu                   The errors in [sin(x)], [cos(x)], and [atan(x)] are below
*24584Szliu                   1 ULP. [Tan(x)], [atan2(y,x)], [acos(x)], and [asin(x)]
*24584Szliu                   have errors below about 2 ULPs.
*24584Szliu
*24584Szliu
*24584Szliu            Here are the results of some test runs to find worst errors on
*24584Szliu            the VAX :
*24584Szliu
*24584Szliu    tan   :  2.09 ULPs          ...1,024,000 random arguments (machine PI)
*24584Szliu    sin   :  .861 ULPs          ...1,024,000 random arguments (machine PI)
*24584Szliu    cos   :  .857 ULPs          ...1,024,000 random arguments (machine PI)
*24584Szliu    (compared with tan, sin, cos of (x*pi/PI))
*24584Szliu
*24584Szliu    acos  :  2.07 ULPs          .....200,000 random arguments (machine PI)
*24584Szliu    asin  :  2.06 ULPs          .....200,000 random arguments (machine PI)
*24584Szliu    atan2 :  1.41 ULPs          .....356,000 random arguments (machine PI)
*24584Szliu    atan  :  0.86 ULPs          ...1,536,000 random arguments (machine PI)
*24584Szliu    (compared with (PI/pi)*(atan, asin, acos, atan2 of x))
*24584Szliu
*24584Szliu    tan   :  2.15 ULPs          ...1,024,000 random arguments (true pi)
*24584Szliu    sin   :  .814 ULPs          ...1,024,000 random arguments (true pi)
*24584Szliu    cos   :  .792 ULPs          ...1,024,000 random arguments (true pi)
*24584Szliu    acos  :  2.15 ULPs          ...1,024,000 random arguments (true pi)
*24584Szliu    asin  :  1.99 ULPs          ...1,024,000 random arguments (true pi)
*24584Szliu    atan2 :  1.48 ULPs          ...1,024,000 random arguments (true pi)
*24584Szliu    atan  :  .850 ULPs          ...1,024,000 random arguments (true pi)
*24584Szliu
*24584Szliu    acosh :  3.30 ULPs          .....512,000 random arguments
*24584Szliu    asinh :  1.58 ULPs          .....512,000 random arguments
*24584Szliu    atanh :  1.71 ULPs          .....512,000 random arguments
*24584Szliu    cosh  :  1.23 ULPs          .....768,000 random arguments
*24584Szliu    sinh  :  1.93 ULPs          ...1,024,000 random arguments
*24584Szliu    tanh  :  2.22 ULPs          ...1,024,000 random arguments
*24584Szliu    log10 :  1.74 ULPs          ...1,536,000 random arguments
*24584Szliu    pow   :  1.79 ULPs          .....100,000 random arguments, 0 < x, y < 20.
*24584Szliu
*24584Szliu    exp   :  .768 ULPs          ...1,156,000 random arguments
*24584Szliu    expm1 :  .844 ULPs          ...1,166,000 random arguments
*24584Szliu    log1p :  .846 ULPs          ...1,536,000 random arguments
*24584Szliu    log   :  .826 ULPs          ...1,536,000 random arguments
*24584Szliu    cabs  :  .959 ULPs          .....500,000 random arguments
*24584Szliu    cbrt  :  .666 ULPs          ...5,120,000 random arguments
*24584Szliu
*24584Szliu
*24584Szliu5. Speed.
*24584Szliu
*24584Szliu        Some functions coded in VAX assembly language (cabs, hypot, sqrt)
*24584Szliu        are significantly faster than the corresponding ones in 4.2BSD.
*24584Szliu        In general, to improve performance, all functions in IEEE/support.c
*24584Szliu        should be written in assembler and, whenever possible, should be
*24584Szliu        called via short subroutine calls.
*24584Szliu
*24584Szliu
*24584Szliu6. j0,j1,jn.
*24584Szliu
*24584Szliu        The modifications to these routines were only in how an invalid
*24584Szliu        floating point operation is signaled.
*24584Szliu
*24584Szliu
*24584Szliu7. Copyright notice, and Disclaimer:
*24584Szliu
*24584Szliu***************************************************************************
*24584Szliu*                                                                         *
*24584Szliu* Copyright (c) 1985 Regents of the University of California.             *
*24584Szliu*                                                                         *
*24584Szliu* Use and reproduction of this software are granted  in  accordance  with *
*24584Szliu* the terms and conditions specified in  the  Berkeley  Software  License *
*24584Szliu* Agreement (in particular, this entails acknowledgement of the programs' *
*24584Szliu* source, and inclusion of this notice) with the additional understanding *
*24584Szliu* that  all  recipients  should regard themselves as participants  in  an *
*24584Szliu* ongoing  research  project and hence should  feel  obligated  to report *
*24584Szliu* their  experiences (good or bad) with these elementary function  codes, *
*24584Szliu* using "sendbug 4bsd-bugs@BERKELEY", to the authors.                     *
*24584Szliu*                                                                         *
*24584Szliu***************************************************************************
*24584Szliu