INDEX
Explanations
questions or statements followed by commas
punctuation and specific interrogative expressions
New Auto-Interp
Negative Logits
inval
-0.77
ixel
-0.67
://
-0.67
treasury
-0.65
gobl
-0.64
Dealer
-0.63
desper
-0.62
murderers
-0.59
corrid
-0.58
condem
-0.58
POSITIVE LOGITS
uh
0.71
then
0.69
say
0.67
eenth
0.67
Sund
0.65
um
0.62
Amen
0.61
usually
0.61
orians
0.60
pray
0.60
Activations Density 0.163%