INDEX
Explanations
specific numeric values or measurements related to quantity and size
New Auto-Interp
Negative Logits
oblin
-0.17
ILT
-0.16
opper
-0.15
502
-0.15
ока
-0.15
Gir
-0.14
_kill
-0.14
Harr
-0.14
opr
-0.14
brain
-0.14
POSITIVE LOGITS
egen
0.16
Rew
0.16
Wor
0.15
rew
0.15
/xhtml
0.14
Rew
0.14
Stevens
0.14
anker
0.14
departure
0.14
aker
0.14
Activations Density 0.024%