the quality or size of the data set, for example measured through tokens;