INDEX
Explanations
pronouns followed by a verb indicating action or opinion
repeated references to the collective pronoun "They."
New Auto-Interp
Negative Logits
CCC
-0.78
Eleven
-0.77
duc
-0.64
INGTON
-0.64
Govern
-0.63
Amen
-0.62
DAY
-0.62
Sirius
-0.61
Innocent
-0.61
âĺħâĺħ
-0.60
POSITIVE LOGITS
're
1.24
zbollah
1.12
'll
1.10
selves
1.07
've
1.05
pherd
0.99
gemony
0.96
resy
0.91
'd
0.90
miah
0.86
Activations Density 0.164%