INDEX
Explanations
references to specific characters or names
New Auto-Interp
Negative Logits
laureate
-0.63
FACE
-0.62
weight
-0.62
poppy
-0.61
Winchester
-0.60
oire
-0.60
Tempest
-0.59
FORMATION
-0.59
dividend
-0.59
ORGE
-0.58
POSITIVE LOGITS
umar
1.31
unin
1.28
ansas
1.24
ernel
1.21
nown
1.18
rish
1.17
htar
1.08
ileaks
1.06
owski
1.04
arak
1.03
Activations Density 0.672%