








               DDCC -- AAnn IInntteerraaccttiivvee DDeesskk CCaallccuullaattoorr


                          _R_o_b_e_r_t _M_o_r_r_i_s
                         _L_o_r_i_n_d_a _C_h_e_r_r_y


                            _A_B_S_T_R_A_C_T

          DC  is  an  interactive  desk  calculator  program
     implemented on the UNIX(R) time-sharing  system  to  do
     arbitrary-precision  integer arithmetic.  It has provi-
     sion for manipulating scaled  fixed-point  numbers  and
     for input and output in bases other than decimal.

          The  size  of  numbers  that can be manipulated is
     limited only by available  core  storage.   On  typical
     implementations  of  UNIX, the size of numbers that can
     be handled varies from several hundred  digits  on  the
     smallest systems to several thousand on the largest.





     _E_d_i_t_o_r_'_s _n_o_t_e_: _t_h_e _d_e_s_c_r_i_p_t_i_o_n _o_f _t_h_e _i_m_p_l_e_m_e_n_t_a_t_i_o_n _d_e_t_a_i_l_s
_o_f _D_C _i_n _t_h_i_s _p_a_p_e_r _i_s _o_n_l_y _v_a_l_i_d _f_o_r _t_h_e _o_r_i_g_i_n_a_l _v_e_r_s_i_o_n _o_f _D_C_.
_T_h_e _c_u_r_r_e_n_t _v_e_r_s_i_o_n _o_f _D_C _u_s_e_s _a _d_i_f_f_e_r_e_n_t _a_p_p_r_o_a_c_h_.

     DC  is an arbitrary precision arithmetic package implemented
on the UNIX time-sharing system in the  form  of  an  interactive
desk  calculator.   It  works  like  a  stacking calculator using
reverse Polish notation.  Ordinarily DC operates on decimal inte-
gers,  but one may specify an input base, output base, and a num-
ber of fractional digits to be maintained.

     A language called BC [1] has been  developed  which  accepts
programs  written  in the familiar style of higher-level program-
ming languages and compiles output which is  interpreted  by  DC.
Some  of  the commands described below were designed for the com-
piler interface and are not easy for a human user to  manipulate.

     Numbers that are typed into DC are put on a push-down stack.
DC commands work by taking the top number or two off  the  stack,
performing  the  desired operation, and pushing the result on the
stack.  If an argument is given, input is taken  from  that  file
until its end, then from the standard input.













USD:5-2                       DC - An Interactive Desk Calculator


SSYYNNOOPPTTIICC DDEESSCCRRIIPPTTIIOONN

     Here  we  describe the DC commands that are intended for use
by people.  The additional  commands  that  are  intended  to  be
invoked by compiled output are described in the detailed descrip-
tion.

     Any number of commands are permitted on a line.  Blanks  and
new-line  characters  are  ignored  except  within numbers and in
places where a register name is expected.

     The following constructions are recognized:

nnuummbbeerr

     The value of the number is pushed onto the  main  stack.   A
     number is an unbroken string of the digits 0-9 and the capi-
     tal letters A-F which are  treated  as  digits  with  values
     10-15 respectively.  The number may be preceded by an under-
     score _ to input a negative  number.   Numbers  may  contain
     decimal points.

++  --  **  %%  ^^

     The  top  two  values on the stack are added (++), subtracted
     (--), multiplied (**), divided (//), remaindered (%%), or  expo-
     nentiated  (^).   The  two entries are popped off the stack;
     the result is pushed on  the  stack  in  their  place.   The
     result  of  a  division is an integer truncated toward zero.
     See the detailed description below for the treatment of num-
     bers  with  decimal  points.   An exponent must not have any
     digits after the decimal point.

ss_x

     The top of the main stack is popped and stored into a regis-
     ter named _x, where _x may be any character.  If the ss is cap-
     italized, _x is treated as a stack and the  value  is  pushed
     onto  it.  Any character, even blank or new-line, is a valid
     register name.

ll_x

     The value in register _x is pushed onto the stack.  The  reg-
     ister _x is not altered.  If the ll is capitalized, register _x
     is treated as a stack and its top value is popped  onto  the
     main stack.

All  registers  start with empty value which is treated as a zero
by the command ll and is treated as an error by the command LL.













DC - An Interactive Desk Calculator                       USD:5-3


dd

     The top value on the stack is duplicated.

pp

     The top value on  the  stack  is  printed.   The  top  value
     remains unchanged.

ff

     All values on the stack and in registers are printed.

xx

     treats  the  top element of the stack as a character string,
     removes it from the stack, and executes it as a string of DC
     commands.

[[ ...... ]]

     puts  the  bracketed  character  string  onto the top of the
     stack.

qq

     exits the program.  If executing  a  string,  the  recursion
     level  is popped by two.  If qq is capitalized, the top value
     on the stack is popped and the  string  execution  level  is
     popped by that value.

<<_x  >>_x  ==_x  !!<<_x  !!>>_x  !!==_x

     The  top  two elements of the stack are popped and compared.
     Register _x is executed if they  obey  the  stated  relation.
     Exclamation point is negation.

vv

     replaces  the  top  element on the stack by its square root.
     The square root of an integer is truncated  to  an  integer.
     For  the  treatment  of numbers with decimal points, see the
     detailed description below.

!!

     interprets the rest of the line as a UNIX command.   Control
     returns to DC when the UNIX command terminates.

cc

     All values on the stack are popped; the stack becomes empty.











USD:5-4                       DC - An Interactive Desk Calculator


ii

     The top value on the stack is popped and used as the  number
     radix  for further input.  If ii is capitalized, the value of
     the input base is pushed onto the stack.  No  mechanism  has
     been  provided  for  the input of arbitrary numbers in bases
     less than 1 or greater than 16.

oo

     The top value on the stack is popped and used as the  number
     radix for further output.  If oo is capitalized, the value of
     the output base is pushed onto the stack.

kk

     The top of the stack is popped, and that value is used as  a
     scale  factor  that  influences the number of decimal places
     that are maintained  during  multiplication,  division,  and
     exponentiation.   The  scale  factor must be greater than or
     equal to zero and less than 100.  If kk is  capitalized,  the
     value of the scale factor is pushed onto the stack.

zz

     The value of the stack level is pushed onto the stack.

??

     A  line of input is taken from the input source (usually the
     console) and executed.

DDEETTAAIILLEEDD DDEESSCCRRIIPPTTIIOONN

IInntteerrnnaall RReepprreesseennttaattiioonn ooff NNuummbbeerrss

     Numbers are stored internally using a dynamic storage  allo-
cator.  Numbers are kept in the form of a string of digits to the
base 100 stored one digit  per  byte  (centennial  digits).   The
string is stored with the low-order digit at the beginning of the
string.  For example, the representation of 157 is  57,1.   After
any arithmetic operation on a number, care is taken that all dig-
its are in the range 0-99 and that  the  number  has  no  leading
zeros.  The number zero is represented by the empty string.

     Negative  numbers  are  represented  in the 100's complement
notation, which is analogous to  two's  complement  notation  for
binary  numbers.   The  high  order digit of a negative number is
always -1 and all other digits are in the range 0-99.  The  digit
preceding the high order -1 digit is never a 99.  The representa-
tion of -157 is 43,98,-1.  We shall call this the canonical  form
of  a  number.   The  advantage of this kind of representation of
negative numbers is ease of addition.  When addition is performed
digit  by digit, the result is formally correct.  The result need









DC - An Interactive Desk Calculator                       USD:5-5


only be modified, if necessary, to put it into canonical form.

     Because the largest valid digit is 99 and the byte can  hold
numbers  twice  that  large,  addition can be carried out and the
handling of carries done later when that  is  convenient,  as  it
sometimes is.

     An  additional  byte  is  stored with each number beyond the
high order digit to indicate the number of assumed decimal digits
after the decimal point.  The representation of .001 is 1,_3 where
the scale has been italicized to emphasize the fact  that  it  is
not the high order digit.  The value of this extra byte is called
the ssccaallee ffaaccttoorr of the number.

TThhee AAllllooccaattoorr

     DC uses a dynamic string storage allocator for  all  of  its
internal  storage.  All reading and writing of numbers internally
is done through the allocator.  Associated with  each  string  in
the  allocator  is  a four-word header containing pointers to the
beginning of the string, the end of the string, the next place to
write,  and  the  next  place to read.  Communication between the
allocator and DC is done via pointers to these headers.

     The allocator initially has one large string on  a  list  of
free strings.  All headers except the one pointing to this string
are on a list of free headers.  Requests for strings are made  by
size.   The  size  of  the  string  actually supplied is the next
higher power of 2.  When a request for  a  string  is  made,  the
allocator  first checks the free list to see if there is a string
of the desired size.  If none is found, the allocator  finds  the
next  larger  free string and splits it repeatedly until it has a
string of the right size.  Left-over strings are put on the  free
list.   If  there  are  no larger strings, the allocator tries to
coalesce smaller  free  strings  into  larger  ones.   Since  all
strings  are  the  result of splitting large strings, each string
has a neighbor that is next to it in core and, if  free,  can  be
combined  with  it  to  make  a string twice as long.  This is an
implementation of the `buddy system' of allocation  described  in
[2].

     Failing to find a string of the proper length after coalesc-
ing, the allocator asks the system for more space.  The amount of
space on the system is the only limitation on the size and number
of strings in DC.  If at any time in the  process  of  trying  to
allocate  a  string,  the  allocator runs out of headers, it also
asks the system for more space.

     There are routines in the allocator  for  reading,  writing,
copying,  rewinding,  forward-spacing,  and  backspacing strings.
All string manipulation is done using these routines.

     The reading and writing routines increment the read  pointer
or  write  pointer so that the characters of a string are read or









USD:5-6                       DC - An Interactive Desk Calculator


written in succession by a series of read or  write  calls.   The
write  pointer  is interpreted as the end of the information-con-
taining portion of a string and a call to read beyond that  point
returns  an end-of-string indication.  An attempt to write beyond
the end of a string causes the allocator  to  allocate  a  larger
space and then copy the old string into the larger block.

IInntteerrnnaall AArriitthhmmeettiicc

     All  arithmetic  operations are done on integers.  The oper-
ands (or operand) needed for the operation are  popped  from  the
main stack and their scale factors stripped off.  Zeros are added
or digits removed as necessary to get a  properly  scaled  result
from  the internal arithmetic routine.  For example, if the scale
of the operands is different and decimal alignment  is  required,
as it is for addition, zeros are appended to the operand with the
smaller scale.  After performing the required  arithmetic  opera-
tion,  the proper scale factor is appended to the end of the num-
ber before it is pushed on the stack.

     A register called ssccaallee plays a part in the results of  most
arithmetic operations.  ssccaallee is the bound on the number of deci-
mal places retained in arithmetic computations.  ssccaallee may be set
to  the  number  on  the top of the stack truncated to an integer
with the kk command.  KK may be used to push the value of ssccaallee  on
the  stack.   ssccaallee  must  be greater than or equal to 0 and less
than 100.  The descriptions of the individual  arithmetic  opera-
tions will include the exact effect of ssccaallee on the computations.

AAddddiittiioonn aanndd SSuubbttrraaccttiioonn

     The scales of the two  numbers  are  compared  and  trailing
zeros  are  supplied  to  the number with the lower scale to give
both numbers the same scale.  The number with the  smaller  scale
is  multiplied by 10 if the difference of the scales is odd.  The
scale of the result is then set to the larger of  the  scales  of
the two operands.

     Subtraction  is  performed by negating the number to be sub-
tracted and proceeding as in addition.

     Finally, the addition is performed digit by digit  from  the
low  order  end of the number.  The carries are propagated in the
usual way.  The resulting number is brought into canonical  form,
which  may  require  stripping  of leading zeros, or for negative
numbers replacing the high-order configuration 99,-1 by the digit
-1.   In any case, digits which are not in the range 0-99 must be
brought into that range, propagating any carries or borrows  that
result.

MMuullttiipplliiccaattiioonn

     The scales are removed from the two operands and saved.  The
operands  are  both  made  positive.   Then   multiplication   is









DC - An Interactive Desk Calculator                       USD:5-7


performed in a digit by digit manner that exactly mimics the hand
method of multiplying.  The first number is  multiplied  by  each
digit  of  the second number, beginning with its low order digit.
The intermediate products are  accumulated  into  a  partial  sum
which  becomes  the  final  product.  The product is put into the
canonical form and its sign is computed from  the  signs  of  the
original operands.

     The  scale  of  the  result  is  set equal to the sum of the
scales of the two operands.  If that scale  is  larger  than  the
internal  register  ssccaallee and also larger than both of the scales
of the two operands, then the scale of the result is set equal to
the largest of these three last quantities.

DDiivviissiioonn

     The  scales  are  removed  from the two operands.  Zeros are
appended or digits removed from the dividend to make the scale of
the result of the integer division equal to the internal quantity
ssccaallee.  The signs are removed and saved.

     Division is performed much as it would be done by hand.  The
difference of the lengths of the two numbers is computed.  If the
divisor is longer than the dividend, zero is returned.  Otherwise
the  top  digit of the divisor is divided into the top two digits
of the dividend.  The result is used as  the  first  (high-order)
digit  of the quotient.  It may turn out be one unit too low, but
if it is, the next trial quotient will be larger than 99 and this
will  be  adjusted at the end of the process.  The trial digit is
multiplied by the divisor and the result subtracted from the div-
idend and the process is repeated to get additional quotient dig-
its until the remaining dividend is smaller than the divisor.  At
the  end,  the  digits of the quotient are put into the canonical
form, with propagation of carry as needed.  The sign is set  from
the sign of the operands.

RReemmaaiinnddeerr

     The  division  routine  is  called and division is performed
exactly as described.  The quantity returned is  the  remains  of
the  dividend  at  the end of the divide process.  Since division
truncates toward zero, remainders have the same sign as the divi-
dend.   The  scale  of the remainder is set to the maximum of the
scale of the dividend and the scale  of  the  quotient  plus  the
scale of the divisor.

SSqquuaarree RRoooott

     The  scale is stripped from the operand.  Zeros are added if
necessary to make the integer result have a  scale  that  is  the
larger  of the internal quantity ssccaallee and the scale of the oper-
and.











USD:5-8                       DC - An Interactive Desk Calculator


     The method used to compute sqrt(y) is Newton's  method  with
successive approximations by the rule

        x sub {n+1} ~=~ half ( x sub n + y over x sub n )

The  initial  guess is found by taking the integer square root of
the top two digits.

EExxppoonneennttiiaattiioonn

     Only exponents with zero scale factor are handled.   If  the
exponent is zero, then the result is 1.  If the exponent is nega-
tive, then it is made positive and the base is divided into  one.
The scale of the base is removed.

     The integer exponent is viewed as a binary number.  The base
is repeatedly squared and the result is obtained as a product  of
those  powers of the base that correspond to the positions of the
one-bits in the binary representation of  the  exponent.   Enough
digits  of the result are removed to make the scale of the result
the same as if the indicated multiplication had been performed.

IInnppuutt CCoonnvveerrssiioonn aanndd BBaassee

     Numbers are converted to the internal representation as they
are read in.  The scale stored with a number is simply the number
of fractional digits input.  Negative numbers  are  indicated  by
preceding  the  number with a __ (an underscore).  The hexadecimal
digits A-F correspond to the numbers 10-15  regardless  of  input
base.   The ii command can be used to change the base of the input
numbers.  This command pops the stack,  truncates  the  resulting
number  to an integer, and uses it as the input base for all fur-
ther input.  The input base is initialized to  10  but  may,  for
example be changed to 8 or 16 to do octal or hexadecimal to deci-
mal conversions.  The command II will push the value of the  input
base on the stack.

OOuuttppuutt CCoommmmaannddss

     The command pp causes the top of the stack to be printed.  It
does not remove the top of the  stack.   All  of  the  stack  and
internal  registers can be output by typing the command ff.  The oo
command can be used to change the output base.  This command uses
the top of the stack, truncated to an integer as the base for all
further output.  The output base in initialized to 10.   It  will
work  correctly  for any base.  The command OO pushes the value of
the output base on the stack.

OOuuttppuutt FFoorrmmaatt aanndd BBaassee

     The input and output bases only affect the interpretation of
numbers  on  input  and output; they have no effect on arithmetic
computations.  Large numbers are output with  70  characters  per
line;  a  \ indicates a continued line.  All choices of input and









DC - An Interactive Desk Calculator                       USD:5-9


output bases work correctly, although not all are useful.  A par-
ticularly  useful  output base is 100000, which has the effect of
grouping digits in fives.  Bases of 8 and 16 can be used for dec-
imal-octal or decimal-hexadecimal conversions.

IInntteerrnnaall RReeggiisstteerrss

     Numbers  or  strings  may be stored in internal registers or
loaded on the stack from registers with the  commands  ss  and  ll.
The command ss_x pops the top of the stack and stores the result in
register xx.  _x can be any character.  ll_x  puts  the  contents  of
register  xx on the top of the stack.  The ll command has no effect
on the contents of  register  _x.   The  ss  command,  however,  is
destructive.

SSttaacckk CCoommmmaannddss

     The  command  cc  clears  the  stack.  The command dd pushes a
duplicate of the number on the top of the  stack  on  the  stack.
The  command zz pushes the stack size on the stack.  The command XX
replaces the number on the top of the stack with its  scale  fac-
tor.   The  command  ZZ  replaces  the  top  of the stack with its
length.

SSuubbrroouuttiinnee DDeeffiinniittiioonnss aanndd CCaallllss

     Enclosing a string in [[ ]] pushes the  ascii  string  on  the
stack.   The  qq  command quits or in executing a string, pops the
recursion levels by two.

IInntteerrnnaall RReeggiisstteerrss -- PPrrooggrraammmmiinngg DDCC

     The load and store commands  together  with  [[  ]]  to  store
strings,  xx  to  execute  and the testing commands `<', `>', `=',
`!<', `!>', `!=' can be  used  to  program  DC.   The  xx  command
assumes the top of the stack is an string of DC commands and exe-
cutes it.  The testing commands compare the top two  elements  on
the  stack  and  if the relation holds, execute the register that
follows the relation.  For example, to print the numbers 0-9,

     [lip1+  si  li10>a]sa
     0si  lax


PPuusshh--DDoowwnn RReeggiisstteerrss aanndd AArrrraayyss

     These commands were designed for used by a compiler, not  by
people.   They  involve push-down registers and arrays.  In addi-
tion to the stack that commands work on, DC can be thought of  as
having  individual stacks for each register.  These registers are
operated on by the commands SS and LL.  SS_x pushes the top value  of
the  main  stack  onto the stack for the register _x.  LL_x pops the
stack for register _x and puts the result on the main stack.   The
commands  ss  and  ll  also  work on registers but not as push-down









USD:5-10                      DC - An Interactive Desk Calculator


stacks.  ll doesn't effect the top of the register  stack,  and  ss
destroys what was there before.

     The  commands  to  work  on arrays are :: and ;;.  ::_x pops the
stack and uses this value as an index into the array _x.  The next
element on the stack is stored at this index in _x.  An index must
be greater than or equal to 0 and less than 2048.  ;;_x is the com-
mand  to  load the main stack from the array _x.  The value on the
top of the stack is the index into the array _x of the value to be
loaded.

MMiisscceellllaanneeoouuss CCoommmmaannddss

     The command !! interprets the rest of the line as a UNIX com-
mand and passes it to UNIX to execute.  One other  compiler  com-
mand  is QQ.  This command uses the top of the stack as the number
of levels of recursion to skip.

DDEESSIIGGNN CCHHOOIICCEESS

     The real reason for the use of a dynamic  storage  allocator
was  that  a  general  purpose  program could be (and in fact has
been) used for a variety of other tasks.  The allocator has  some
value  for  input and for compiling (i.e.  the bracket [...] com-
mands) where it cannot be known in advance how long a string will
be.   The result was that at a modest cost in execution time, all
considerations of string allocation and  sizes  of  strings  were
removed  from the remainder of the program and debugging was made
easier.  The allocation method used wastes approximately  25%  of
available space.

     The  choice  of  100 as a base for internal arithmetic seem-
ingly has no compelling advantage.  Yet the  base  cannot  exceed
127  because  of  hardware  limitations  and at the cost of 5% in
space, debugging was made a great deal easier and decimal  output
was made much faster.

     The  reason for a stack-type arithmetic design was to permit
all DC commands from  addition  to  subroutine  execution  to  be
implemented  in  essentially the same way.  The result was a con-
siderable degree of logical separation of the final program  into
modules with very little communication between modules.

     The  rationale for the lack of interaction between the scale
and the bases was to provide an understandable means of  proceed-
ing after a change of base or scale when numbers had already been
entered.  An earlier implementation which had global  notions  of
scale and base did not work out well.  If the value of ssccaallee were
to be interpreted in the current input or  output  base,  then  a
change of base or scale in the midst of a computation would cause
great confusion in the interpretation of the results.   The  cur-
rent  scheme  has  the  advantage that the value of the input and
output bases are only used for input  and  output,  respectively,
and they are ignored in all other operations.  The value of scale









DC - An Interactive Desk Calculator                      USD:5-11


is not used for any essential purpose by any part of the  program
and  it  is  used  only  to  prevent the number of decimal places
resulting from the arithmetic operations from growing beyond  all
bounds.

     The  design  rationale for the choices for the scales of the
results of arithmetic were that in no case should any significant
digits  be  thrown  away  if,  on  appearances, the user actually
wanted them.  Thus, if the user wants to add the numbers 1.5  and
3.517,  it seemed reasonable to give him the result 5.017 without
requiring  him  to  unnecessarily  specify  his  rather   obvious
requirements for precision.

     On the other hand, multiplication and exponentiation produce
results with many more digits than their operands and  it  seemed
reasonable  to  give as a minimum the number of decimal places in
the operands but not to give more  than  that  number  of  digits
unless  the  user asked for them by specifying a value for ssccaallee.
Square root can be handled in just the same  way  as  multiplica-
tion.   The  operation of division gives arbitrarily many decimal
places and there is simply no way to guess how  many  places  the
user  wants.  In this case only, the user must specify a ssccaallee to
get any decimal places at all.

     The scale of remainder was chosen to  make  it  possible  to
recreate  the  dividend from the quotient and remainder.  This is
easy to implement; no digits are thrown away.

RReeffeerreenncceess

[1]  L. L. Cherry, R. Morris, _B_C _- _A_n _A_r_b_i_t_r_a_r_y  _P_r_e_c_i_s_i_o_n  _D_e_s_k_-
     _C_a_l_c_u_l_a_t_o_r _L_a_n_g_u_a_g_e_.

[2]  K.  C.  Knowlton, _A _F_a_s_t _S_t_o_r_a_g_e _A_l_l_o_c_a_t_o_r_, Comm. ACM 88, pp.
     623-625 (Oct. 1965).

























