INDEX
Explanations
lines or placeholders in forms or surveys
New Auto-Interp
Negative Logits
umann
-0.16
.fix
-0.15
skl
-0.15
aina
-0.14
ега
-0.14
??
-0.14
somehow
-0.14
onn
-0.14
akt
-0.14
Treat
-0.13
POSITIVE LOGITS
istrator
0.19
ober
0.18
../../../../
0.18
itud
0.17
GOODMAN
0.17
quarters
0.17
../../
0.16
%%%
0.16
urnal
0.15
.infinity
0.15
Activations Density 0.014%