资源说明:A Part of Speech Tagger using a Hidden Markov Model
For the Sequence-Tagging final project, we implemented a Hidden Markov Model part-of-speech tagger. We implemented the algorithms from scratch, and have included the code for our system. For the description of the system, see the report. To run baseline system: java Baseline java Scorer (note: data must be placed in the data folder and be called train.pos/test.pos) open scoring/score.html To run HMM system: java HMM java Scorer (note: data must be placed in the data folder and be called train.pos/test.pos) open scoring/score.html The file scoring/score.html contains the percent correct as well as the number of times a POS tag on the top was chosen for a POS tag on the left (the correct tag in the test data). That is, the left tags were guessed as the top tags. To see the words that contributed to the number in a particular box, click the box to toggle the word list. Each score-*.html file is the score file for that particular configuration of the HMM system. Refer to the report for a description of each.
本源码包内暂不包含可直接显示的源代码文件,请下载源码包。
English
