INDEX
Explanations
phrases related to meeting or not meeting expectations
phrases related to expectations and their fulfillment
New Auto-Interp
Negative Logits
contrace
-0.74
etsk
-0.67
İĭ
-0.66
smoking
-0.62
fusc
-0.61
flush
-0.58
aves
-0.57
ramer
-0.57
navig
-0.57
DOWN
-0.57
POSITIVE LOGITS
edly
0.71
adolesc
0.69
ouse
0.64
iously
0.62
IGHTS
0.62
gypt
0.61
Marina
0.61
uations
0.60
amia
0.59
iments
0.59
Activations Density 0.068%