INDEX
Explanations
instances of the prefix "un-" indicating negation or reversal of meaning
New Auto-Interp
Negative Logits
/***/
-0.16
locked
-0.15
e
-0.15
anca
-0.15
packed
-0.14
zilla
-0.14
itime
-0.14
biased
-0.14
active
-0.14
ftime
-0.14
POSITIVE LOGITS
amage
0.20
ilater
0.18
bid
0.17
icum
0.16
icolor
0.16
iere
0.15
ccess
0.15
igit
0.15
ÑĢÑı
0.15
ickers
0.14
Activations Density 0.030%