Matrices mod p, The Size of O

The Size of O

Consider the orthonormal matrices over F with a predefined top row r. Let S be one such matrix, the standard matrix. Premultiply by any matrix that is 1 in the upper left and orthonormal in its lower right block of size n-1. The top row is still r, and the resulting matrix is orthonormal. This builds a map from orthonormal matrices of dimension n-1 into orthonormal matrices of dimension n with a top row of r.

Next, reverse this bijection. Take any orthonormal matrix M with top row r and multiply on the right by S inverse, (also known as S transpose), and the result is orthonormal, with 1 in the upper left. Therefore the number of matrices in O with top row r is equal to the number of orthonormal matrices of dimension n-1. Of course this assumes a standard matrix S exists.

Look at the case n = 3 over a finite field F of size w. We would like to produce a standard matrix S for each top row r. But how many rows are there? Remember that the length of r is 1. When the upper left entry is ±1, the other two have squares that sum to 0. This contributes 4w-2, or 2, as w is 1 or 3 mod 4. For anything else in the upper left, the remaining two squares sum to something nonzero. We showed earlier that sums of squares are distributed evenly across the nonzero entries of F. This yields w-2 times (w-1 or w+1), as w is 1 or 3 mod 4. Add these up and get w²+w or w²-w, as w is 1 or 3 mod 4. Finally, multiply this expression by the size of O for n = 2, which was determined earlier. The result is w(w-1)(w+1), regardless of w mod 4. But we still have to demonstrate a standard orthonormal matrix S for each row r.

Start with r and extend this to an n dimensional basis. Scale one of the rows, so that the determinant is 1. Then apply the gram schmidt process to create an orthogonal basis. The determinant is still 1. Dot each row with itself, and multiply the results together, and get the square of the determinant, which is 1. An even number of dot products are nonsquares.

Scale each row so that its dot product is 1 or q, where q is your favorite nonsquare in F. If all dot products are 1 then S is orthonormal, and we're done. Otherwise consider a pair of rows with dot products equal to q. Call these two vectors u and v. Replace u with au+bv, and v with -bu+av, where a and b are arbitrary scalars in F. Dot either of these linear combinations with any other row and the result is still 0. Verify that (au+bv).(-bu+av) = 0. Now evaluate (au+bv).(au+bv) and get (a²+b²)q. Similarly, (-bu+av).(-bu+av) = (a²+b²)q. Since sums of squares are evenly distributed across the nonzero entries of F, there is an a and b such that a²+b² = 1/q. The two rows now have length 1. Do this for every pair of nonsquare rows, and S becomes orthonormal. That completes the proof.

With the size of O₃ established, you might think it is isomorphic to G₂. After all, both groups have the same order, namely w(w-1)(w+1). But G₂ has a center, namely ±1, while O₃ has only 1 as its center. So the groups are not isomorphic.

Higher Dimensions

The procedure for extending an arbitrary unit vector r out to a standard orthonormal matrix S works in higher dimensions, and leads to a recursive formula for the size of O, based on a recursive formula for the number of unit vectors r. Let z_n be the number of n dimensional vectors with length 0, and let y_n be the number of n dimensional vectors with length 1. We know that y₂ would come out the same for any nonzero length. We'll see that this holds for even indexes, but not for odd indexes. When n is odd, let q_n count the number of vectors whose length is a given nonsquare.

Just to review:

z₁ = 1, y₁ = 1, o₁ = 1.

z₂ = 2w-1 | 1, y₂ = w-1 | w+1, o₂ = w-1 | w+1.

z₃ = w², y₃ = w²+w | w²-w, q₃ = w²-w | w²+w, o₃ = w³-w

When n is even, build z_n in two ways. Let the first n-2 entries have squares that sum to 0, and then bring in the last two entries. This gives z_n-2z₂. Then the first n-2 entries could produce something nonzero, and the last two must take up the slack. Similar reasoning holds for y_n.

z_n = z_n-2z₂ + (w-1)y_n-2y₂

y_n = z_n-2y₂ + y_n-2z₂ + (w-2)y_n-2y₂

Even though z₂ and y₂ depend on the value of w mod 4, z₄ and y₄ do not.

z₄ = w³+w²-w, y₄ = w³-w, o₄ = w²(w-1)²(w+1)².

Verify that our formula for y₄ comes out the same for a nonsquare length. Like y₂, y₄ is the same across the nonzero lengths. In fact, as you step through n = 6, 8, 10, and so on, q_n always equals y_n. The distribution of vectors is even across the nonzero lengths.

Dimension 5

When n is odd we need some different formulas. Instead of separating the first two entries, make the first entry 0 or nonzero and derive the following.

z_n = z_n-1 + (w-1)y_n-1

y_n = (w-2)y_n-1 + 2z_n-1

q_n = wy_n-1

z₅ = w⁴, y₅ = w⁴+w², q₅ = w⁴-w², o₅ = w⁴(w-1)²(w+1)²(w²+1).

6 and Beyond

If you move on to dimension 6, use the formulas based on z₄, y₄, z₂, and y₂.

For dimension 7, separate the first entry and use the formulas that were applied to dimension 5.

Continue this process as far as you like. In 6, as with 2, the formula for o depends on w mod 4.

z₆ = w⁵+w³-w² | w⁵-w³+w², y₆ = w⁵-w² | w⁵+w², o₆ = z₆o₅

z₇ = w⁶, y₇ = w⁶+w³ | w⁶-w³, q₇ = w⁶-w³ | w⁶+w³, o₇ = z₇o₆

z₈ = w⁷+w⁴-w³, y₈ = w⁷-w³, o₈ = z₈o₇