What is the file lexicon_rbst.tdl for in the ERG?

(Yes, I did search the wiki :slight_smile: ).

The file lexicon-rbst.tdl contains lexical entries that are not part of the standard grammar of English, but which appear with some frequency even in edited text, including duplicate entries such as “the the” or the erroneous use of “a” with a vocalic onset word, as in “a apple”. Each of these entries is marked with the feature-value pair “GENRE robust” so analyses using them can be controlled via the appropriate root condition when parsing. Normally, they are included for parsing to improve coverage, but excluded for generation.

1 Like

Thank you, @Dan !

It would be good to add this at the top of the lexicon_rbst.tdl file, …