INDEX
Explanations
phrases related to specific effects or phenomena
references to various "effects" or phenomena described in social or psychological contexts
New Auto-Interp
Negative Logits
pigeon
-0.72
mbuds
-0.65
Methodist
-0.65
zar
-0.63
Timber
-0.63
Kats
-0.60
rooft
-0.60
Standards
-0.60
quarters
-0.59
Dud
-0.59
POSITIVE LOGITS
iveness
1.21
uated
1.16
ual
1.11
uating
1.05
ually
1.05
ively
1.04
uel
0.98
uate
0.98
bringer
0.98
uality
0.97
Activations Density 0.034%