INDEX
Explanations
references to competition or conflict involving specific characters or events
New Auto-Interp
Negative Logits
_consts
-0.15
acades
-0.15
£
-0.14
ordin
-0.14
uat
-0.14
aze
-0.14
eldom
-0.14
áž
-0.14
:CGRect
-0.13
libertine
-0.13
POSITIVE LOGITS
adv
0.15
dit
0.15
again
0.15
мо
0.15
advance
0.14
quat
0.14
_again
0.14
tÃŃ
0.14
fi
0.13
typical
0.13
Activations Density 0.003%