cs381k p. 276

Part-of-Speech Tagging

N-gram statistics can be used to guess the part-of-speech of words in text. If the part-of-speech of each word can be tagged correctly, parsing ambiguity is greatly reduced.

'Twas brillig, and the slithy toves
did gyre and gimble in the wabe.[from Jabberwocky, by Lewis Carroll.]

A Hidden Markov Model (HMM) tagger chooses the tag for each word that maximizes: [Jurafsky, op. cit.] P(word | tag) * P(tag | previous n tags)

For a bigram tagger, this is approximated as:
t_i = argmax_j P( w_i | t_j ) P( t_j | t_{i - 1} )

In practice, trigram taggers are most often used, and a search is made for the best set of tags for the whole sentence; accuracy is about 96%.

Contents Page-10 Prev Next Page+10 Index