BELZ, ANJA (2002) Grammar learning by partition search In: Proceedings of the LREC 2002 workshop on event modelling for multilingual document linking, Las Palmas, Spain.Full text not available from this repository.
This paper describes Grammar Learning by Partition Search, a general method for automatically constructing grammars for a range of parsing tasks. Given a base grammar, a training corpus, and a parsing task, Partition Search constructs an optimised probabilistic context-free grammar by searching a space of nonterminal set partitions, looking for a partition that maximises parsing performance and minimises grammar size. The method can be used to optimise grammars in terms of size and performance, or to adapt existing grammars to new parsing tasks and new domains. This paper reports an example application to optimising a base grammar extracted from the Wall Street Journal Corpus. Partition Search improves parsing performance by up to 5.29%, and reduces grammar size by up to 16.89%. Parsing results are better than in existing treebank grammar research, and compared to other grammar compression methods, Partition Search has the advantage of achieving compression without loss of grammar coverage.
|Item Type:||Contribution to conference proceedings in the public domain ( Full Paper)|
|Uncontrolled Keywords:||Natural language generation; Partition searching;|
|Subjects:||Q000 Languages and Literature - Linguistics and related subjects > Q100 Linguistics|
|Faculties:||Faculty of Science and Engineering > School of Computing, Engineering and Mathematics > Natural Language Technology|
|Date Deposited:||18 Nov 2007|
|Last Modified:||07 Feb 2013 03:05|
Actions (login required)
Downloads per month over past year