Assessing the trade-off between system building cost and output quality in data-to-text generation
BELZ, ANJA and Kow, Eric (2010) Assessing the trade-off between system building cost and output quality in data-to-text generation In: Krahmer, E. and Theune, M., eds. Empirical Methods in Natural Language Generation. Lecture Notes in Computer Science, 5790 . Springer-Verlag, Berlin, Heidelberg, pp. 180-200. ISBN 9783642155727Full text not available from this repository.
Official URL: http://www.springerlink.com/content/w571871367854w...
Data-to-text generation systems tend to be knowledge-based and manually built, which limits their reusability and makes them time and cost-intensive to create and maintain. Methods for automating (part of) the system building process exist, but do such methods risk a loss in output quality? In this paper, we investigate the cost/quality trade-off in generation system building. We compare six data-to-text systems which were created by predominantly automatic techniques against six systems for the same domain which were created by predominantly manual techniques. We evaluate the systems using intrinsic automatic metrics and human quality ratings. We find that there is some correlation between degree of automation in the system-building process and output quality (more automation tending to mean lower evaluation scores). We also find that there are discrepancies between the results of the automatic evaluation metrics and the human-assessed evaluation experiments. We discuss caveats in assessing system-building cost and implications of the discrepancies in automatic and human evaluation.
Actions (login required)
Downloads per month over past year