INDEX
Explanations
statements or messages containing verbs in the past tense
phrases related to official statements or messages
New Auto-Interp
Negative Logits
ño
-0.72
apolis
-0.70
Goodman
-0.66
pload
-0.66
engeance
-0.64
idable
-0.64
avement
-0.64
ilogy
-0.63
ascal
-0.63
icho
-0.63
POSITIVE LOGITS
aloud
1.35
just
1.05
dress
0.98
mitt
0.90
ahead
0.83
mill
0.81
ILY
0.80
printed
0.79
comprehension
0.77
ied
0.76
Activations Density 0.027%