Improving Statistical Linguistic Algorithms for Parsing Mathematics
Cezary Kaliszyk, Josef Urban, Jiri Vyskočil
11th International Workshop on the Implementation of Logics, EPiC 40, pp. 27-36, 2016..
Abstract
In this paper we describe our combined statistical/semantic parsing method based on the CYK chart-parsing algorithm augmented with limited internal typechecking and external ATP ltering. This method was previously evaluated on parsing ambiguous mathematical expressions over the informalized Flyspeck corpus of 20000 theorems. We rst discuss the motivation and drawbacks of the rst version of the CYK-based component of the algorithm, and then we propose and implement a more sophisticated approach based on better statistical model of mathematical data structures.
@inproceedings{ckjujv-iwil15,
author = {Cezary Kaliszyk and Josef Urban and Ji\v{r}\'{\i} Vysko\v{c}il},
title = {Improving Statistical Linguistic Algorithms for Parsing Mathematics},
booktitle = {The 11th International Workshop on the Implementation of Logics (IWIL'15)},
editor = {Boris Konev and Stephan Schulz and Laurent Simon},
series = {EasyChair Proceedings in Computing},
volume = {40},
pages = {27-36},
year = {2016},
publisher = {EasyChair},
}