Variable Typing Corpus

Variable Typing Corpus

Sentences in mathematical texts often contain variables and their associated types. Variable typing is the task of assigning the type to each free variable appearing in a sentence. We have annotated a gold dataset of 7,803 sentences composed of 33,524 assignment relations (arcs) between variables and mathematical types. The sentences in our corpus are sourced from the Mathematical REtrieval Corpus (MREC), a subset of arXiv (over 439,000 papers).

Download the Variable Typing Corpus

Please cite the following paper:

Variable typing: data set for assigning meaning to variables in mathematical text