INDEX
Explanations
terms related to emotional and moral concepts of belief and intention
New Auto-Interp
Negative Logits
Thirty
-0.89
ULAR
-0.79
USS
-0.75
YR
-0.74
Recomm
-0.73
ãĥĥãĥī
-0.71
Specific
-0.70
URA
-0.67
Stars
-0.66
trak
-0.66
POSITIVE LOGITS
edly
1.09
syndrome
0.86
nered
0.85
affair
0.84
prevailed
0.75
ned
0.74
attitude
0.70
impression
0.70
excuse
0.68
prevail
0.67
Activations Density 0.050%