INDEX
Explanations
emotional states and expressions of well-being
New Auto-Interp
Negative Logits
Halk
-0.15
adm
-0.15
qli
-0.15
obs
-0.15
tear
-0.15
mess
-0.14
Brun
-0.14
Mess
-0.14
RICS
-0.13
μβ
-0.13
POSITIVE LOGITS
ä¾
0.18
ownership
0.17
thouse
0.17
ownership
0.16
Ownership
0.16
vÄĽd
0.16
/Dk
0.16
threat
0.15
IFn
0.15
çµ
0.15
Activations Density 0.067%