Project

General

Profile

Bug #888

Updated by Philippe Proulx almost 9 years ago

In section 4.1.5 (Integers), examples should be written to explain how integers are packed one after the other depending on the byte order, because it is not obvious with only the current text. 

 Here's an example for both big-endian and little-endian situations. 

 h3. Big-endian 

 Big-endian words are packed from left to right within a sequence of bytes. 

 Let's pack the following words, in this order, each one having a 1-bit alignment (each individual letter represents a bit, the first one being the most significant): 

 * @ABCDEFGHIJ@ (10 bits) 
 * @klmno@ (5 bits) 
 * @PQRS@ (4 bits) 
 * @tuvwxyz@ (7 bits) 

 Starting at a byte offset, here are the steps (@#@ are eventual bits of padding, or bits that belong to other, eventual fields): 

 <pre> 
 ABCDEFGH IJ###### 
 ABCDEFGH IJklmno# 
 ABCDEFGH IJklmnoP QRS##### 
 ABCDEFGH IJklmnoP QRStuvwx yz###### 
 </pre> 

 h3. Little-endian 

 Little-endian words have less straightforward packing rules. 

 Always start with the least significant bits of the word to pack, and, from right to left within a given byte (LSBs to MSBs), put as many bits as you can of said word's least significant bits, in the usual order (MSBs of the word's substring are still on the right in relation to the LSBs). 

 Let's take the same words as the big-endian example above. Starting at a byte offset, here are the steps: 

 <pre> 
 CDEFGHIJ ######AB 
 CDEFGHIJ #klmnoAB 
 CDEFGHIJ SklmnoAB #####PQR 
 CDEFGHIJ SklmnoAB vwxyzPQR ######tu 
 </pre> 

 h3. Bit position 

 In both examples, at the end, the current bit position is 26. Only, in big-endian we are travelling from left to right within the sequence of bytes, whereas in little-endian, we are travelling from right to left within each byte of the sequence: 

 Big-endian: 

 <pre> 
 ######## 
 ^ ? 
 0 

 ABCDEFGH IJ###### 
 -------> ->^ ???????? ??? 
            10 

 ABCDEFGH IJklmno# 
            ---->^ ?????? 
                 15 

 ABCDEFGH IJklmnoP QRS##### 
                 > -->^ ? ???? 
                      19 

 ABCDEFGH IJklmnoP QRStuvwx yz###### 
                      ----> ->^ ????? ??? 
                              26 
 </pre> 

 Little-endian: 

 <pre> 
 ######## 
        ^ ? 
        0 

 CDEFGHIJ ######AB 
 <------- ????????        ^<- ??? 
               10 

 CDEFGHIJ #klmnoAB 
          ^<---- ?????? 
          15 

 CDEFGHIJ SklmnoAB #####PQR 
          < ?              ^<-- ???? 
                       19 

 CDEFGHIJ SklmnoAB vwxyzPQR ######tu 
                   <---- ?????           ^<- ??? 
                                 26 
 </pre> 

 Aligning fields within bytes is easy when we understand this bit position direction. For example, if we want to align the second word (@klmno@) at multiples of 4, here's the result in both byte orders: 

 Big-endian: 

 <pre> 
 ABCDEFGH IJ##klmn oPQRStuv wxyz#### 
                                ^ ? 
                                28 
 </pre> 

 Little-endian: 

 <pre> 
 CDEFGHIJ lmno##AB xyzPQRSk ####tuvw 
                               ^ ? 
                               28 
 </pre> 

 %{color:red}This is also why packing words which have different byte orders within one byte is **undefined**, and the specification should explicitly tell so as well.% 


 h3. Floating point numbers 

 Floating point numbers obey to the same rules. A CTF floating point number, in its binary form, is a word with a length equal to the sum of its mantissa and exponent lengths. 

 Consider the 1-bit aligned integer 347 (word @101011011@), followed by -3.1415927 represented on an 8-bit exponent, 24-bit mantissa, 4-bit aligned floating point number (word @11000000010010010000111111011011@): 

 Big-endian: 

 <pre> 
 ######## 
 ^ ? 
 0 

 10101101 1####### 
           ^ ? 
           9 

 10101101 1###1100 00000100 10010000 11111101 1011#### 
                                                  ^ ? 
                                                  44 
 </pre> 

 Little-endian: 

 <pre> 
 ######## 
        ^ ? 
        0 

 01011011 #######1 
                ^ ? 
                9 

 01011011 1011###1 11111101 10010000 00000100 ####1100 
                                                 ^ ? 
                                                 44 
 </pre> 

 h3. Strings 

 The packing rules do not apply to CTF strings since their characters (always 8-bit words) must be aligned on multiples of 8.

Back