Lexicodes

In order to compute $\ensuremath{f_{\mbox{\small lexi}}}$ we will first transform the generator matrix of the code into a minimal span generator matrix (MSGM) form, in which the sum of the spans of the generators is minimized. Adapting the notation in [43], the span of a binary n-vector x = (x₁, x₂, x₃,..., x_n) is $\ensuremath{\operatorname{R}}(x)-\ensuremath{\operatorname{L}}(x)$ , where $\ensuremath{\operatorname{R}}(\cdot)$ and $\ensuremath{\operatorname{L}}(\cdot)$ are the rightmost (i.e. largest) and leftmost (i.e. smallest) index i, respectively, such that x_i $\not=$ 0. Thus, the span of the vector x = (0001001100) is $\ensuremath{\operatorname{R}}(x)-\ensuremath{\operatorname{L}}(x)=8-4=4$ . We can efficiently transform any matrix into MSGM-form using the greedy algorithm in [43]. We note that two vectors of an MSGM cannot have their leftmost or their rightmost index in common, or else they may be added to produce a generator matrix with shorter span.

Given an MSGM for a code and a set of coset representatives $\mathcal {V}$ , Method 2.2 computes the lexicographically earliest vector among the cosets represented in $\mathcal {V}$ .

Method 2..2 Consider a set of vectors $\mathcal {V}$ representing cosets of a code $\mathbb {C}$ with length n. Let G be an MSGM for $\ensuremath{\mathbb{C}}$ whose generators are in lexicographically increasing order. The following greedy method computes the lexicographically earliest vector among the represented cosets in time O(n| V|) and space O(1).

This method looks for the generators whose left-most 1-bit corresponds to 1-bits in vectors v $\mbox{$\,\inn\,$}$ V. Figure 2.4 demonstrates this method with the set $\mathcal {V}$ = {1010000} on the (7, 4, 3) code described in Figure 2.2. For this case, the vector 0001011 is the lexicographically earliest vector in the same coset as 1110000. Note that the ordering of the generators in the MSGM is significant and that different orderings might not yield the lexicographically earliest vector. Thus, if generators G₂ and G₃ are switched, then Method 2.2 yields 0010010, which is not the lexicographically earliest vector desired.

Figure: Method 2.2 applied to the code in Figure 2.2 with initial condition $\mathcal {V}$ = {1110000}. The various values taken on by v during the computation are shown on the right-hand region.

G₁ =	0000111
G₂ =	0011110
G₃ =	0110100
G₄ =	1001000

v:	1110000
v $\leftarrow$ v + G₄:	0111000
v $\leftarrow$ v + G₃:	0001100
v $\leftarrow$ v + G₁:	0001011

We will now prove the correctness and complexity bounds of Method 2.2. In our applications, $\mathcal {V}$ will typically be the set of coset leaders with maximum distance from $\ensuremath{\mathbb{C}}$ , in which case Method 2.2 computes precisely $\ensuremath{f_{\mbox{\small lexi}}}(\ensuremath{\mathbb{C}})$ .

Proof of Method 2.2. We first show correctness of the method by proving that, for each v $\mbox{$\,\inn\,$}$ $\mathcal {V}$ , lines 2-5 compute the lexicographically earliest vector in the same coset as v. This way, line 6 in the method will return the lexicographically earliest vector among all the cosets represented.

We know from [43, Thm 6.11 and Lemma 6.7] that the rows of G have the predictable support property:

span $\displaystyle \left(\vphantom{\sum_{j \mbox{$\,\inn\,$}J} G_j }\right.$ $\displaystyle \sum_{{j \mbox{$\,\inn\,$}J}}^{}$ G_j $\displaystyle \left.\vphantom{\sum_{j \mbox{$\,\inn\,$}J} G_j }\right)$ = $\displaystyle \bigcup_{{j \mbox{$\,\inn\,$}J}}^{}$ span(G_j)

(9)

Now suppose that v_best is the lexicographically earliest vector in the same coset as v, but that instead v_stored is the vector stored on line 5 of the method. Our goal will be to show that the difference between these two vectors, denoted v_diff, is in fact 0.

Clearly v, v_best, and v_stored are necessarily in the same coset of $\ensuremath{\mathbb{C}}$ . Moreover, since v and v_stored are in the same coset, v_diff must be a code word of $\ensuremath{\mathbb{C}}$ so that we can write (for an appropriate set J_diff):

Assume (for sake of contradiction) that L(v_diff )= k, meaning that the left-most 1 bit of v_diff occurs at the k-th index. This implies that, starting from the left-most bit, v_best and v_stored are identical until the k-th index, on which they differ. Since, by definition, v_best comes lexicographically before v_stored, it must be that v_best has a 0 and v_stored a 1 at the k-th index. However, (2.5) then implies that there must be some generator vector G_j whose left-index is k, contradicting our constructive generation of v_stored. Thus, the left-index of v_diff could not possibly be at location k, for any k, so that v_diff must be 0 and correctness of the method is proved.

The space bound for this method follows straightforwardly, as at any point in the algorithm we need to actively store only the lexicographically earliest vector among those determined on line 5 so far. The time bound follows from the fact that lines 3-5 are each computable in constant time, giving a running time of O(n| V|) for lines 1-5; Line 6 can be computed with constant overhead on-line, as each vector is stored.

width4pt depth2pt height6pt

Method 2.2 provides a fast way of computing the lexicographic generating mapping. Appendix A.1 gives computed parameters for many lexicodes that were thus computed. This list is more exhaustive than what is currently available in the literature [9,14].