INDEX
Explanations
words related to people's names or roles
the preposition "in"
New Auto-Interp
Negative Logits
Seym
-0.90
redund
-0.75
destro
-0.73
warr
-0.72
millenn
-0.69
catentry
-0.69
hovah
-0.69
¶æ
-0.68
payday
-0.68
reluct
-0.67
POSITIVE LOGITS
strument
1.44
aug
1.24
jury
1.18
iti
1.16
ners
1.13
flation
1.12
jured
1.11
hibited
1.10
cluded
1.08
ned
1.07
Activations Density 0.051%