INDEX
Explanations
references to a year enclosed in parentheses
instances of parentheses or related punctuation
New Auto-Interp
Negative Logits
lull
-0.75
nuts
-0.70
ĸļ
-0.64
psychiat
-0.63
inund
-0.62
nesday
-0.62
obscurity
-0.62
everyday
-0.61
ylum
-0.61
electr
-0.60
POSITIVE LOGITS
?)
1.06
formerly
1.06
including
1.01
...)
1.00
see
1.00
also
0.98
!)
0.98
excluding
0.97
via
0.96
emphasis
0.94
Activations Density 0.123%