INDEX
Explanations
occurrences of specific names and titles
New Auto-Interp
Negative Logits
ardon
-0.17
ано
-0.15
èĨľ
-0.14
ÑĨез
-0.14
loe
-0.14
Arbitrary
-0.13
Mare
-0.13
edor
-0.13
Marino
-0.13
Slot
-0.13
POSITIVE LOGITS
teil
0.16
eldorf
0.15
iej
0.15
GOODMAN
0.15
MainFrame
0.15
strup
0.14
Jed
0.14
.hm
0.14
Santa
0.14
unya
0.14
Activations Density 0.374%