Hook length formula

In combinatorial mathematics, the hook-length formula is a formula for the number of standard Young tableaux whose shape is a given Young diagram. It has applications in diverse areas such as representation theory, probability, and algorithm analysis; for example, the problem of longest increasing subsequences.

Definitions and statement

Let \lambda=(\lambda_1,\ldots,\lambda_m) be a partition of n. It is customary to interpret \lambda graphically as a Young diagram, namely a left-justified array of square cells with m rows and \lambda_i cells in the ith row for each 1\le i\le m. A standard Young tableau of shape \lambda is a Young diagram of shape \lambda in which each of the n cells contains a distinct integer between 1 and n (i.e., no repetition), such that each row and each column form increasing sequences. For each cell of the Young diagram in coordinates (i,j) (that is, the cell in the ith row and jth column), the hook H_\lambda(i,j) is the set of cells (a,b) such that a=i and b \ge j or a \ge i and b=j. The hook-length h_\lambda(i,j) is the number of cells in the hook H_\lambda(i,j).

Then the hook-length formula expresses the number of standard Young tableaux of shape \lambda, sometimes denoted by d_\lambda, as

 d_\lambda = \frac{n!}{\prod h_\lambda (i,j)} ,

where the product is over all cells (i,j) of \lambda.

Example

A tableau listing the hook length of each cell in the Young diagram
(4, 3, 1, 1)

The figure on the right shows hook-lengths for all cells in the Young diagram λ of the partition
9 = 4 + 3 + 1 + 1. Then the number of Young tableaux d_\lambda for this Young diagram can be computed as

d_\lambda = \frac{9!}{7\cdot 5\cdot 4\cdot 3\cdot 2\cdot 2\cdot 1\cdot 1\cdot 1} = 216.

History

There are other formulas for d_\lambda, but the hook-length formula is particularly simple and elegant. The hook-length formula was discovered in 1954 by J. S. Frame, G. de B. Robinson, and R. M. Thrall by improving a less convenient formula expressing d_\lambda in terms of a determinant. [1] This earlier formula was deduced independently by G. Frobenius and A. Young in 1900 and 1902 respectively using algebraic methods. [2] [3] P. A. MacMahon found an alternate proof for the Young–Frobenius formula in 1916 using difference methods. [4]

Despite the simplicity of the hook-length formula, the Frame–Robinson–Thrall proof is uninsightful and does not provide an intuitive argument as to why hooks appear in the formula. The search for a short, intuitive explanation befitting such a simple result gave rise to many alternate proofs for the hook-length formula. [5] A. P. Hillman and R. M. Grassl gave the first proof that illuminates the role of hooks in 1976 by proving a special case of the Stanley hook-content formula, which is known to imply the hook-length formula. [6] C. Greene, A. Nijenhuis, and H. S. Wilf found a probabilistic proof using the hook walk in which the hook lengths appear naturally in 1979. [7] J. B. Remmel adapted the original Frame–Robinson–Thrall proof into the first bijective proof for the hook-length formula in 1982. [8] A direct bijective proof was first discovered by D. S. Franzblau and D. Zeilberger in 1982. [9] D. Zeilberger also converted the Greene–Nijenhuis–Wilf hook walk proof into a bijective proof in 1984. [10] A simpler direct bijective proof was announced by Igor Pak and Alexander V. Stoyanovskii in 1992, and its complete proof was presented by the pair and Jean-Christophe Novelli in 1997. [11] [12]

Meanwhile, the hook-length formula has been generalized in several ways. R. M. Thrall found the analogue to the hook-length formula for shifted Young Tableaux in 1952. [13] B. E. Sagan gave a shifted hook walk proof for the hook-length formula for shifted Young tableaux in 1980. [14] B. E. Sagan and Y. N. Yeh proved the hook-length formula for binary trees using the hook walk in 1989. [15]

Proofs

Knuth's heuristic argument

The hook-length formula can be understood intuitively using the following heuristic, but incorrect, argument suggested by D. E. Knuth.[16] Given that each element of a tableau is the smallest in its hook and filling the tableau shape at random, the probability that cell (i,j) will contain the minimum element of the corresponding hook is the reciprocal of the hook length. Multiplying these probabilities over all i and j gives the formula. This argument is fallacious since the events are not independent.

Knuth's argument is however correct for the enumeration of labellings on trees satisfying monotonicity properties analogous to those of a Young tableau. In this case, the 'hook' events in question are in fact independent events.

Probabilistic proof using the hook walk

This is a probabilistic proof found by C. Greene, A. Nijenhuis, and H. S. Wilf in 1979.[17] Here is a sketch of the proof. Define

e_\lambda = \frac{n!}{\prod_{(i,j)\in Y(\lambda)}h_\lambda(i,j)}.

we would like to show that d_\lambda = e_\lambda. The first observation about d_\lambda is

d_\lambda = \sum_{\mu\uparrow\lambda}d_\mu,
Corners of the Young diagram (5,3,2,1,1)

where \mu\uparrow\lambda denotes that \mu are Young tableau obtained from  \lambda by deleting one corner cell from \lambda. The sum is taken over all such \mu. Here we are taking the convention that d_{\phi}=1, where \phi denotes the empty diagram. The explanation for the above equation is that the maximal entry of the Young tableau of shape \lambda occurs at one of its corner cells. By deleting that cell we will obtain a Young tableau of shape \mu. Since the number of Young tableau of shape \mu is d_{\mu}, taking the sum over all such \mu we get the equation.

Notice that we also have e_{\phi}=1. Therefore it is enough to show that

e_\lambda = \sum_{\mu\uparrow\lambda}e_\mu,

and the result d_\lambda=e_\lambda then follows by induction. The above sum can also be viewed as a sum of probabilities by rewriting the equation to be shown as

 \sum_{\mu\uparrow\lambda}\frac{e_\mu}{e_\lambda}=1.

We therefore need to show that the numbers \frac{e_\mu}{e_\lambda} define a probability measure on the set of Young diagrams \mu (where \mu\uparrow\lambda). This is done in a constructive way by defining a random walk, called the hook walk, on the cells of the Young diagram \lambda, which eventually selects one of the corner cells of \lambda (which are in bijection with diagrams \mu for which \mu\uparrow\lambda). The hook walk is defined by the following rules.

(1) Pick a cell uniformly at random from |\lambda| cells. Start the random walk from there.

(2) Successor of current cell (i,j) is chosen uniformly at random from the hook H_{\lambda}(i,j)\setminus \{(i,j)\}.

(3) Continue until you reach at one of the corner cells, call it \textbf{c}.

Proposition: For any corner cell (a,b) of \lambda we have

\mathbb{P}\left(\textbf{c}=(a,b)\right)=\frac{e_\mu}{e_\lambda},

where \mu=\lambda\setminus\{(a,b)\}.

Once we have the above proposition, taking the sum over all possible corner cells \textbf{c}=(a,b) we have  \sum_{\mu\uparrow\lambda}\frac{e_{\mu}}{e_{\lambda}}=1, as claimed.


Connection to representation theory

The hook-length formula is of great importance in the representation theory of the symmetric group S_n, where the number d_\lambda is known to be equal to the dimension of the irreducible representation V_\lambda associated to \lambda, and is frequently denoted by \dim V_\lambda, \dim \lambda or f^\lambda.

Detailed discussion

The irreducible representations  V_{\lambda} of the symmetric group are indexed by partitions  \lambda of  n (for an explicit construction see Specht module) . Their characters are related to the theory of symmetric functions via the Hall inner product in the following magical formula

 \chi^\lambda(w)=\langle s_\lambda, p_{\tau(w)}\rangle

where  s_{\lambda} is the schur function associated to  \lambda and  p_{\tau(w)} is the power symmetric function of the partition  \tau(w) associated to the cycle decomposition of  w . For example, if  w=(154)(238)(6)(79) then  \tau(w) = (3,3,2,1) .

Since  e=(1)(2)\cdots(n) in cycle notation,  \tau(e) = 1+1+\cdots+1=1^{(n)} . Then the formula says

 \dim  V_\lambda =\chi^\lambda(e) = \langle s_\lambda, p_{1^{(n)}}\rangle

Considering the expansion of schur functions in terms on monomial symmetric functions using the Kostka numbers

  s_\lambda= \sum_\mu K_{\lambda\mu}m_\mu.\

the inner product with  p_{1^{(n)}} = h_{1^{(n)}} is   K_{\lambda 1^{(n)}} , because   \langle m_{\mu},h_{\nu} \rangle = \delta_{\mu\nu} . Note that  K_{\lambda 1^{(n)}}  is equal to  d_{\lambda} . Hence

 \dim V_\lambda = d_\lambda.

An immediate consequence of this is

 \sum_{\lambda\vdash n} \left(f^{\lambda}\right)^2 = n!

The above equality also a simple consequence of the Robinson–Schensted–Knuth correspondence.

The computation also shows that:

 (x_1+x_2+\cdots+x_k)^n = \sum_{\lambda\vdash n} s_\lambda f^\lambda.

Which is the expansion of   p_{1^{(n)}} in terms of schur functions using the coefficients given by the inner product, because  \langle s_{\mu},s_{\nu}\rangle = \delta_{\mu\nu}  . The above equality can be proven also checking the coefficients of each monomial at both sides and using the Robinson–Schensted–Knuth correspondence or, more conceptually, looking at the decomposition of V^{\otimes n} by irreducible   GL(V) modules, and taking characters. See Schur–Weyl duality.

Outline of the proof of the hook formula using Frobenius formula

By the above considerations

 p_{1^{(n)}} =  \sum_{\lambda\vdash n} s_{\lambda}f^{\lambda}

So that

 \Delta(x)p_{1^{(n)}} = \sum_{\lambda\vdash n} \Delta(x)s_{\lambda}f^{\lambda}

where   \Delta(x) = \prod_{i<j} (x_i-x_j) is the Vandermonde determinant

For a given partition  \lambda = (\lambda_1, \lambda_2, \cdots, \lambda_k) define  l_i = \lambda_i+k-1 for  i=1,2,\cdots,k . For the following we need at least as many variables as rows in the partition, so for now on we work with n variables x_1,\cdots,x_n.

Each term  \Delta(x)s_{\lambda}  is equal to sums of

 a_{(\lambda_1+n-1, \lambda_2+n-2, \dots , \lambda_k)} (x_1, x_2, \dots , x_k) =
\det \left[ \begin{matrix} x_1^{l_1} & x_2^{l_1} & \dots & x_k^{l_1} \\
x_1^{l_2} & x_2^{l_2} & \dots & x_k^{l_2} \\
\vdots & \vdots & \ddots & \vdots \\
x_1^{l_k} & x_2^{l_k} & \dots & x_k^{l_k} \end{matrix} \right]

See Schur function. Since the vector  (l_1,l_2,\cdots, l_k) is different for each partition, this means that the coefficient of   x_1^{l_1}\cdots x_k^{l_k} in \Delta(x)p_{1^{(n)}}, denoted \left[\Delta(x)p_{1^{(n)}}\right]_{l_1,\cdots,l_k}, is equal to  f^{\lambda} . This is known as the Frobenius Character Formula, which gives one of the earliest proofs [18] All that remains is tracking that coefficient with a mixture of cleverness and brute force:

 \Delta(x) = \sum_{w\in S_n}  \sgn(w) x_1^{w(1)-1}x_2^{w(2)-1}\cdots x_k^{w(k)-1}
 (x_1+x_2+\cdots+x_k)^n = \sum \frac{n!}{d_1!d_2!\cdots d_k!} x_1^{d_1}x_2^{d_2}\cdots x_k^{d_k}

By pairing, the coefficient that we are looking for is

 \sum_{w\in S_n} \sgn(w)\frac{n!}{(l_1-w(1)+1)!(l_2-w(2)+1)!\cdots (l_k-w(k)+1)!}

which can be written as

 \frac{n!}{l_1!l_2!\cdots l_k!} \sum_{w\in S_n} \sgn(w)\left[(l_1)(l_1-1)\cdots(l_1-w(1)+2)\right] \left[(l_2)(l_2-1)\cdots(l_2-w(2)+2)\right]\left[(l_k)(l_k-1)\cdots(l_k-w(k)+2)\right]

the sum is equal to the following determinant

 \det \left[ \begin{matrix} 1 & l_1 & l_1(l_1-1) & \dots & \prod_{i=0}^{k-2} l_1-i \\
1 & l_2 & l_1(l_2-1) & \dots & \prod_{i=0}^{k-2} l_2-i\\
\vdots & \vdots & \vdots& \ddots & \vdots \\
1 & l_k & l_k(l_k-1) & \dots & \prod_{i=0}^{k-2} l_k-i \end{matrix} \right]

which column reduces to Vandermonde, and we obtain the formula

 \frac{n!}{l_1!l_2!\cdots l_k!} \prod_{i<j} (l_i-l_j)

Note that  l_i is the hook length of the first box in each row of the Young Diagram. From here its easy to clean the mess of the cancellations and get the hook formula.

Connection to longest increasing subsequences

The hook length formula also has important applications to the analysis of longest increasing subsequences in random permutations. If \sigma_n denotes a uniformly random permutation of order n, L(\sigma_n) denotes the maximal length of an increasing subsequence of \sigma_n, and \ell_n denotes the expected (average) value of L(\sigma_n), Anatoly Vershik and Sergei Kerov [19] and independently Benjamin F. Logan and Lawrence A. Shepp [20] showed that when n is large, \ell_n is approximately equal to 2 \sqrt{n}. This answers a question originally posed by Stanislaw Ulam. The proof is based on translating the question via the Robinson–Schensted correspondence to a problem about the limiting shape of a random Young tableau chosen according to Plancherel measure. Since the definition of Plancherel measure involves the quantity d_\lambda, the hook length formula can then be used to perform an asymptotic analysis of the limit shape and thereby also answer the original question.

The ideas of Vershik–Kerov and Logan–Shepp were later refined by Jinho Baik, Percy Deift and Kurt Johansson, who were able to achieve a much more precise analysis of the limiting behavior of the maximal increasing subsequence length, proving an important result now known as the Baik–Deift–Johansson theorem. Their analysis again makes crucial use of the fact that d_\lambda has a number of good formulas, although instead of the hook length formula it made use of one of the determinantal expressions.

Related formulas

The formula for the number of Young tableau of a given shape was originally derived from the Frobenius determinant formula in connection to representation theory.[21] If the shape of a Young diagram is given by the row lengths n_1,\dots,n_m, then the number of tableau with that shape is given by

 f(n_1,n_2,\ldots,n_m) = \frac{n! \,\Delta(n_m,n_{m-1}+1,\ldots,n_1 + m - 1)}{n_m ! (n_{m-1} + 1)! \cdots (n_1 + m - 1)!}

Hook lengths can also be used to give a product representation to the generating function for the number of reverse plane partitions of a given shape.[22] If λ is a partition of some integer p, a reverse plane partition of n with shape λ is obtained by filling in the boxes in the Young diagram with non-negative integers such that the entries add to n and are non-decreasing along each row and down each column. The hook lengths  h_1,\dots,h_p can be defined as with Young tableau. If πn denotes the number of reverse plane partitions of n with shape λ, then the generating function can be written as

 \sum_{n=0}^\infty \pi_n x^n = \prod_{k=1}^p (1-x^{h_k})^{-1}

Stanley discovered another formula for the same generating function.[23] In general, if  A is any poset with  n elements, the generating function for reverse  A -partitions is

 \frac{P(x)}{(1-x)(1-x^2)\cdots(1-x^n)}

where  P(x)  is a polynomial such that P(1) is the number of natural labelings of  A .

In the case of a partition  \lambda , we are considering the poset in its cells given by the relation

 (i,j) \leq (i',j') \iff i\leq i' \qquad \textrm{and} \qquad j\leq j'.

So a natural labeling is simply a standard Young tableau, i.e.  P(1) = f^{\lambda}

Yet another proof of the hook length formula

Combining the two formulas for the generating functions we have

 \frac{P(x)}{(1-x)(1-x^2)\cdots(1-x)^n}  = \prod_{(i,j)\in \lambda} (1-x^{h_{(i,j)}})^{-1}

Both sides converge inside the disk of radius one and the following expression makes sense for  |x| < 1

 P(x) = \frac{\prod_{k=1}^n (1-x^k)}{\prod_{(i,j)\in \lambda} (1-x^{h_{(i,j)}})}.

It would be violent to plug in 1, but the right hand side is a continuous function inside the unit disk and a polynomial is continuous everywhere so at least we can say

 P(1) = \lim_{x\to 1} \frac{\prod_{k=1}^n (1-x^k)}{\prod_{(i,j)\in \lambda} (1-x^{h_{(i,j)}})}.

Finally, applying L'Hopital's rule  n times yields the hook length formula

 P(1) = \frac{n!}{\prod_{(i,j)\in \lambda}h_{(i,j)}}.

Specialization of schur functions

Specializing the schur functions to the variables  1,t,t^2,t^3, \cdots there is the formula

 s_{\lambda}(1,t,t^2,\cdots) = \frac{t^{n\left( \lambda\right)}}{\prod_{(i,j)\in Y(\lambda)}(1-t^{h_{\lambda}(i,j)})}

The number  n(\lambda) is defined as

 n(\lambda) = \sum_i (i-1)\lambda_i = \sum_i \binom{\lambda_i'}{2}

where  \lambda' is the conjugate partition

A formula for the Catalan numbers

The Catalan numbers are ubiquitous in enumerative combinatorics. Not surprisingly, they are also part of this story:

 C_n = f^{(n,n)}

Lets briefly mention why. When doing a Dyck path we may go up (U) or down (D). So for any Dyck path of length  n consider the tableaux of shape  (n,n) such that the first row is given by the numbers  i such that the  i -th step was up and in the second row given by the positions in which it goes down. For example UUDDUD correspond to the tableaux with rows 125 and 346.

The hook formula gives another way of getting a closed formula for the Catalan numbers

 C_n = \frac{(2n)!}{(n+1)(n)\cdots(3)(2)(n)(n-1)\cdots(2)(1)} = \frac{(2n)!}{(n+1)!n!} = \frac{1}{n+1}\binom{2n}{n}

(\TO DO: in the Catalan article they mention this relation and the have a link for hook length formula, which redirects to Young tableau, which is well written but offers no intuition about the math. Those articles should be redirected to this one)

See also

References

  1. Frame, J. S., Robinson, G. de B. and Thrall, R. M. (1954). The hook graphs of the symmetric group. Canad. J. Math. 6, 316–325.
  2. G. Frobenius. Uber die charaktere der symmetrischer gruppe, Preuss. &ad. Wk. sitz. (1900), 516–534.
  3. A. Young. Quantitative substitutional analysis II, Proc. London Math. Sot., Ser. 1, 35 (1902), 361–397.
  4. P. A. MacMahon. “Combinatory Analysis,” Cambridge Univ. Press, London/New York, 1916; reprinted by Chelsea, New York, 1960.
  5. Knuth, Donald (1973). The Art of Computer Programming, Volume 3: Sorting and Searching, 3rd Edition, Addison–Wesley, p. 63
  6. A. P. Hillman and R. M. Grassl. Reverse plane partitions and tableau hook numbers, J. Comb. Theory, Ser. A 21 (1976), 216–221.
  7. Greene, C., Nijenhuis, A. and Wilf, H. S. (1979). A probabilistic proof of a formula for the number of Young tableaux of a given shape. Adv. in Math. 31, 104–109.
  8. J. B. Remmel. Bijective proofs of formulae for the number of standard Young tableaux, Linear and Multilinear Algebra 11 (1982), 45–100.
  9. Franzblau, D. S. and Zeilberger, D. (1982). A bijective proof of the hook-length formula. J. Algorithms 3, 317–343.
  10. D. Zeilberger. A short hook-lengths bijection inspired by the Greene–Nijenhuis–Wilf proof, Discrete Math. 51 (1984), 101–108.
  11. Pak, I. M. and Stoyanovskii, A. V. (1992). A bijective proof of the hook-length formula. Funct. Anal. Appl. 24.
  12. Novelli, J.-C., Pak, I. M. and Stoyanovskii, A. V. (1997). A direct bijective proof of the hook-length formula. Discrete Mathematics and Theoretical Computer Science 1, 1997, 53–67.
  13. R. M. Thrall. A combinatorial problem, Michigan Math. J. 1 (1952), 81–88.
  14. Sagan, B. On selecting a random shifted Young tableau. J. Algorithms 1, 3 (1980), 213–234.
  15. Sagan, B. E., and Yeh, Y. N. Probabilistic algorithms for trees. Fibonacci Quart. 27, 3 (1989), 201–208.
  16. Knuth, Donald (1973), The Art of Computer Programming, Volume 3: Sorting and Searching, 3rd Edition, Addison–Wesley, p. 63, ISBN 0-201-03803-X.
  17. Greene, C., Nijenhuis, A. and Wilf, H. S. (1979). A probabilistic proof of a formula for the number of Young tableaux of a given shape. Adv. in Math. 31, 104–109.
  18. W. Fulton, J. Harris. Representation Theory: A First Course Springer-Verlag , New York, 1991
  19. Vershik, A. M.; Kerov, C. V. (1977), "Asymptotics of the Plancheral measure of the symmetric group and a limiting form for Young tableaux", Dokl. Akad. Nauk USSR 233: 1024–1027
  20. B. F. Logan and L. A. Shepp, A variational problem for random Young tableaux, Advances in Math. 26 (1977), no. 2, 206–222.
  21. Knuth, Donald (1973), The Art of Computer Programming 3 (1 ed.), Addison–Wesley, pp. 61–62
  22. Stanley, Richard P. (1971), "Theory and applications of plane partitions, 2", Studies in Applied Mathematics 50: 259–279
  23. R.P. Stanley, "Ordered Structures and Partitions" PhD Thesis, Harvard University, 1971

External links