I’m creating a list of test cases for my phenomena, and hand-spun a TSDB++ item generator from my tables. I was having some issues until I realized that TSDB++ really wanted an integer value for the field “i-difficulty” so I just defaulted it to 1. I’m assuming this field means something like “How difficult is it to capture this sentence grammatically?” but I realized I don’t have a good understanding for all of the fields in the Relations file. I figured this would be a good place to ask. Below are the required item fields and my interpretation of them. My question is: What is the intended meaning of the fields I’ve left off or guessed at?
id (integer) - Unique ID
origin - speaker or location the data came from
register - sociolinguistic register??
format - orthography choice??
difficulty - difficulty to parse??
category - ??
input - non-segmented orthography
tokens - segmented orthography
gloss - glossed segments
translation - free translation
wf - well-formedness (0 ungrammatical, 1 grammatical)
length - number of words (why is this necessary?)
comment - free comment field
author - author of the test suite/test item
date - date it was added (also unclear why this is necessary)