INDEX
Explanations
sports-related terms, such as league names and events
punctuation and formatting markers in the text
New Auto-Interp
Negative Logits
nodd
-0.77
unnecess
-0.75
necks
-0.72
tack
-0.71
spons
-0.71
pport
-0.71
buyers
-0.68
downright
-0.68
folk
-0.68
slic
-0.68
POSITIVE LOGITS
During
1.60
Afterwards
1.59
Later
1.55
Shortly
1.50
Following
1.46
Throughout
1.43
Eventually
1.41
Additionally
1.40
Upon
1.40
After
1.35
Activations Density 0.167%