INDEX
Explanations
phrases indicating a high level of performance or quality
phrases indicating common experiences or situations related to societal norms and expectations
New Auto-Interp
Negative Logits
lav
-0.82
yond
-0.80
aba
-0.78
rig
-0.77
imar
-0.76
ipl
-0.74
azel
-0.74
amic
-0.74
ollo
-0.73
ulu
-0.73
POSITIVE LOGITS
athlet
0.69
offensively
0.66
applause
0.66
financially
0.66
academ
0.63
understatement
0.61
mascara
0.60
.
0.60
speculation
0.59
beaut
0.58
Activations Density 0.540%