suckless.org/1/git/libzahl/file/doc/miscellaneous.tex.gph

  URI:

       miscellaneous.tex - libzahl - big integer library
  HTML git clone git://git.suckless.org/libzahl
   DIR Log
   DIR Files
   DIR Refs
   DIR README
   DIR LICENSE
       ---
       miscellaneous.tex (10583B)
       ---
            1 \chapter{Miscellaneous}
            2 \label{chap:Miscellaneous}
            3 
            4 In this chapter, we will learn some miscellaneous
            5 functions. It might seem counterintuitive to start
            6 with miscellanea, but it is probably a good idea
            7 to read this before arithmetics and more advanced
            8 topics. You may read \secref{sec:Marshalling}
            9 later. Before reading this chapter you should
           10 have read \chapref{chap:Get started}.
           11 
           12 
           13 \vspace{1cm}
           14 \minitoc
           15 
           16 
           17 \newpage
           18 \section{Assignment}
           19 \label{sec:Assignment}
           20 
           21 To be able to do anything useful, we must assign
           22 values to integers. There are three functions for
           23 this: {\tt zseti}, {\tt zsetu}, and {\tt zsets}.
           24 The last letter in the names of these function
           25 describe the data type of the input, `i', `u',
           26 and `s' stand for `integer', `unsigned integer',
           27 and `string`, respectively. These resemble the
           28 rules for the format strings in the family of
           29 {\tt printf}-functions. `Integer' of course refer
           30 to `signed integer'; for integer types in C,
           31 part from {\tt char}, the keyword {\tt signed}
           32 is implicit.
           33 
           34 Consider {\tt zseti},
           35 
           36 \begin{alltt}
           37    \textcolor{c}{z_t two;}
           38    \textcolor{c}{zinit(two);}
           39    zseti(two, 2);
           40 \end{alltt}
           41 
           42 \noindent
           43 assignes {\tt two} the value 2. The data type of
           44 the second parameter of {\tt zseti} is {\tt int64\_t}.
           45 It will accept any integer value in the range
           46 $[-2^{63},~2^{63} - 1] = [-9223372036854775808,~9223372036854775807]$,
           47 independently of the machine.\footnote{{\tt int64\_t}
           48 is defined to be a signed 64-bit integer using two's
           49 complement representation.} If this range so not wide
           50 enough, it may be possible to use {\tt zsetu}. Its
           51 second parameter of the type {\tt uint64\_t}, and thus
           52 its range is $[0,~2^{64} - 1] = [0,~18446744073709551615]$.
           53 If a need negative value is desired, {\tt zsetu} can be
           54 combined with {\tt zneg} \psecref{sec:Sign manipulation}.
           55 
           56 For enormous constants or textual input, {\tt zsets}
           57 can be used. {\tt zsets} will accept any numerical
           58 value encoded in decimal ASCII, that only contain
           59 digits, \emph{not} decimal points, whitespace,
           60 apostrophes, et cetera. However, an optional plus
           61 sign or, for negative numbers, an ASCII minus sign
           62 may be used as the very first character. Note that
           63 a proper UCS minus sign is not supported.
           64 
           65 Using what we have learned so far, and {\tt zstr}
           66 which we will learn about in \secref{sec:String output},
           67 we can construct a simple program that calculates the
           68 sum of a set of numbers.
           69 
           70 \begin{alltt}
           71    \textcolor{c}{#include <stdio.h>
           72    #include <stdlib.h>
           73    #include <zahl.h>}
           74 
           75    int
           76    main(int argc, char *argv[]) \{
           77        z_t sum, temp;
           78        \textcolor{c}{jmp_buf failenv;
           79        char *sbuf, *argv0 = *argv;
           80        if (setjmp(failenv)) \{
           81            zperror(argv0);
           82            return 1;
           83        \}
           84        zsetup(failenv);
           85        zinit(sum);
           86        zinit(term);}
           87        zsetu(sum, 0);
           88        for (argv++; *argv; argv++) \{
           89            zsets(term, *argv);
           90            zadd(sum, sum, term);
           91        \}
           92        \textcolor{c}{printf("\%s\textbackslash{}n", (sbuf = zstr(sum, NULL, 0)));
           93        free(sbuf);
           94        zfree(sum);
           95        zfree(term);
           96        zunsetup();
           97        return 0;}
           98    \}
           99 \end{alltt}
          100 
          101 Another form of assignment available in libzahl is
          102 copy-assignment. This done using {\tt zset}. As
          103 easily observable, {\tt zset} is named like
          104 {\tt zseti}, {\tt zsetu}, and {\tt zsets}, but
          105 without the input-type suffix. The lack of a
          106 input-type suffix means that the input type is
          107 {\tt z\_t}. {\tt zset} copies value of second
          108 parameter into the reference in the first. For
          109 example, if {\tt v}, of the type {\tt z\_t}, has
          110 value 10, then {\tt a} will too after the instruction
          111 
          112 \begin{alltt}
          113    zset(a, v);
          114 \end{alltt}
          115 
          116 {\tt zset} does not necessarily make an exact
          117 copy of the input. If, in the example above, the
          118 {\tt a->alloced} is greater than or equal to
          119 {\tt v->used}, {\tt a->alloced} and {\tt a->chars}
          120 are preserved, of course, the content of
          121 {\tt a->chars} is overridden. If however,
          122 {\tt a->alloced} is less then {\tt v->used},
          123 {\tt a->alloced} is assigned a minimal value at
          124 least as great as {\tt v->used} that is a power
          125 of 2, and {\tt a->chars} is updated accordingly
          126 as described in \secref{sec:Integer structure}.
          127 This of course does not apply if {\tt v} has the
          128 value 0; in such cases {\tt a->sign} is simply
          129 set to 0.
          130 
          131 {\tt zset}, {\tt zseti}, {\tt zsetu}, and
          132 {\tt zsets} require that the output-parameter
          133 has been initialised with {\tt zinit} or an
          134 equally acceptable method as described in
          135 \secref{sec:Create an integer}.
          136 
          137 {\tt zset} is often unnecessary, of course
          138 there are cases where it is needed. In some case
          139 {\tt zswap} is enough, and advantageous.
          140 {\tt zswap} is defined as
          141 
          142 \begin{alltt}
          143    \textcolor{c}{static inline} void
          144    zswap(z_t a, z_t b)
          145    \{
          146        z_t t;
          147        *t = *a;
          148        *a = *b;
          149        *b = *t;
          150    \}
          151 \end{alltt}
          152 
          153 \noindent
          154 however its implementation is optimised to be
          155 around three times as fast. It just swaps the members
          156 of the parameters, and thereby the values. There
          157 is no rewriting of {\tt .chars} involved; thus
          158 it runs in constant time. It also does not
          159 require that any argument has been initialised.
          160 After the call, {\tt a} will be initialised
          161 if and only if {\tt b} was initialised, and
          162 vice versa.
          163 
          164 
          165 \newpage
          166 \section{String output}
          167 \label{sec:String output}
          168 
          169 Few useful things can be done without creating
          170 textual output of calculations. To convert a
          171 {\tt z\_t} to ASCII string in decimal, we use the
          172 function {\tt zstr}, declared as
          173 
          174 \begin{alltt}
          175    char *zstr(z_t a, char *buf, size_t n);
          176 \end{alltt}
          177 
          178 \noindent
          179 {\tt zstr} will store the string it creates into
          180 {\tt buf} and return {\tt buf}. However, if {\tt buf}
          181 is {\tt NULL}, a new memory segment is allocated
          182 and returned. {\tt n} should be at least the length
          183 of the resulting string sans NUL termiantion, but
          184 not larger than the allocation size of {\tt buf}
          185 minus 1 byte for NUL termiantion. If {\tt buf} is
          186 {\tt NULL}, {\tt n} may be 0. However if {\tt buf}
          187 is not {\tt NULL}, it is unsafe to let {\tt n} be
          188 0, unless {\tt buf} has been allocated by {\tt zstr}
          189 for a value of {\tt a} at least as larger as the
          190 value of {\tt a} in the new call to {\tt zstr}.
          191 Combining non-\texttt{NULL} {\tt buf} with 0 {\tt n}
          192 is unsafe because {\tt zstr} will use a very fast
          193 formula for calculating a value that is at least
          194 as large as the resulting output length, rather
          195 than the exact length.
          196 
          197 The length of the string output by {\tt zstr} can
          198 be predicted by {\tt zstr\_length}, decleared as
          199 
          200 \begin{alltt}
          201    size_t zstr_length(z_t a, unsigned long long int radix);
          202 \end{alltt}
          203 
          204 \noindent
          205 It will calculated the length of {\tt a} represented
          206 in radix {\tt radix}, sans NUL termination. If
          207 {\tt radix} is 10, the length for a decimal
          208 representation is calculated.
          209 
          210 Sometimes it is possible to never allocate a {\tt buf}
          211 for {\tt zstr}. For example, in an implementation
          212 of {\tt factor}, you can reuse the string of the
          213 value to factorise, since all of its factors are
          214 guaranteed to be no longer than the factored value.
          215 
          216 \begin{alltt}
          217    void
          218    factor(char *value)
          219    \{
          220        size_t n = strlen(value);
          221        z_t product, factor;
          222        zsets(product, value);
          223        printf("\%s:", value);
          224        while (next_factor(product, factor))
          225            printf(" \%s", zstr(factor, value, n));
          226        printf("\verb|\|n");
          227    \}
          228 \end{alltt}
          229 
          230 Other times it is possible to allocate just
          231 once, for example of creating a sorted output.
          232 In such cases, the allocation can be done almost
          233 transparently.
          234 
          235 \begin{alltt}
          236    void
          237    output_presorted_decending(z_t *list, size_t n)
          238    \{
          239        char *buf = NULL;
          240        while (n--)
          241            printf("\%s\verb|\|n", (buf = zstr(*list++, buf, 0)));
          242    \}
          243 \end{alltt}
          244 
          245 \noindent
          246 Note, this example assumes that all values are
          247 non-negative.
          248 
          249 
          250 
          251 \newpage
          252 \section{Comparison}
          253 \label{sec:Comparison}
          254 
          255 libzahl defines four functions for comparing
          256 integers: {\tt zcmp}, {\tt zcmpi}, {\tt zcmpu},
          257 and {\tt zcmpmag}. These follow the same naming
          258 convention as {\tt zset}, {\tt zseti}, and
          259 {\tt zsetu}, as described in \secref{sec:Assignment}.
          260 {\tt zcmpmag} compares the absolute value, the
          261 magnitude, rather than the proper value. These
          262 functions are declared as
          263 
          264 \begin{alltt}
          265    int zcmp(z_t a, z_t b);
          266    int zcmpi(z_t a, int64_t b);
          267    int zcmpu(z_t a, uint64_t b);
          268    int zcmpmag(z_t a, z_t b);
          269 \end{alltt}
          270 
          271 \noindent
          272 They behave similar to {\tt memcmp} and
          273 {\tt strcmp}.\footnote{And {\tt wmemcmp} and
          274 {\tt wcscmp} if you are into that mess.}
          275 The return value is defined
          276 
          277 \vspace{1em}
          278 \(
          279     \mbox{sgn}(a - b) =
          280     \left \lbrace \begin{array}{rl}
          281         -1 & \quad \textrm{if}~a < b \\
          282          0 & \quad \textrm{if}~a = b \\
          283         +1 & \quad \textrm{if}~a > b
          284     \end{array} \right .
          285 \)
          286 \vspace{1em}
          287 
          288 \noindent
          289 for {\tt zcmp}, {\tt zcmpi}, and {\tt zcmpu}.
          290 The return for {\tt zcmpmag} value is defined
          291 
          292 \vspace{1em}
          293 \(
          294     \mbox{sgn}(\lvert a \rvert - \lvert b \rvert) =
          295     \left \lbrace \begin{array}{rl}
          296         -1 & \quad \textrm{if}~\lvert a \rvert < \lvert b \rvert \\
          297          0 & \quad \textrm{if}~\lvert a \rvert = \lvert b \rvert \\
          298         +1 & \quad \textrm{if}~\lvert a \rvert > \lvert b \rvert
          299     \end{array} \right .
          300 \)
          301 \vspace{1em}
          302 
          303 \noindent
          304 It is discouraged, stylistically, to compare against
          305 $-1$ and $+1$, rather, you should always compare
          306 against $0$. Think of it as returning $a - b$, or
          307 $\lvert a \rvert - \lvert b \rvert$ in the case of
          308 {\tt zcmpmag}.
          309 
          310 
          311 \newpage
          312 \section{Marshalling}
          313 \label{sec:Marshalling}
          314 
          315 libzahl is designed to provide efficient communication
          316 for multi-processes applications, including running on
          317 multiple nodes on a cluster computer. However, these
          318 facilities require that it is known that all processes
          319 run the same version of libzahl, and run on compatible
          320 microarchitectures, that is, the processors must have
          321 endianness, and the intrinsic integer types in C must
          322 have the same widths on all processors. When this is not
          323 the case, string conversion can be used (see \secref{sec:Assignment}
          324 and \secref{sec:String output}), but when it is the case
          325 {\tt zsave} and {\tt zload} can be used. {\tt zsave} and
          326 {\tt zload} are declared as
          327 
          328 \begin{alltt}
          329    size_t zsave(z_t a, char *buf);
          330    size_t zload(z_t a, const char *buf);
          331 \end{alltt}
          332 
          333 \noindent
          334 {\tt zsave} stores a version- and microarchitecture-dependent
          335 binary representation of {\tt a} in {\tt buf}, and returns
          336 the number of bytes written to {\tt buf}. If {\tt buf} is
          337 {\tt NULL}, the numbers bytes that would have be written is
          338 returned. {\tt zload} unmarshals an integers from {\tt buf},
          339 created with {\tt zsave}, into {\tt a}, and returns the number
          340 of read bytes. {\tt zload} returns the value returned by
          341 {\tt zsave}.