next up previous
Next: Computing outer probabilities Up: Rule probability estimation Previous: Rule probability estimation

Computing expected production counts

Before going into the details of computing outer probabilities we describe their use in obtaining the expected rule counts needed for the E-step in grammar estimation.

Let tex2html_wrap_inline8763 denote the expected number of uses of production tex2html_wrap_inline7913 in the derivation of string x. Alternatively, tex2html_wrap_inline8763 is the expected number of times that tex2html_wrap_inline7913 is used for prediction in a complete Earley path generating x. Let tex2html_wrap_inline8775 be the number of occurrences of predicted states based on production tex2html_wrap_inline7913 along a path tex2html_wrap_inline8779 .

eqnarray6345

The last summation is over all predicted states based on production tex2html_wrap_inline7913 . The quantity tex2html_wrap_inline8791 is the sum of the probabilities of all paths passing through tex2html_wrap_inline8793 . Inner and outer probabilities have been defined such that this quantity is obtained precisely as the product of the corresponding of tex2html_wrap_inline8795 and tex2html_wrap_inline8797 . Thus, the expected usage count for a rule can be computed as

displaymath8761

The sum can be computed after completing both forward and backward passes (or during the backward pass itself) by scanning the chart for predicted states.



Andreas Stolcke
Sat Jun 29 21:49:02 PDT 1996