INDEX
Explanations
phrases expressing frustration and anger
emotional expressions and reactions
New Auto-Interp
Negative Logits
etheless
-0.86
xtap
-0.81
upon
-0.72
prisingly
-0.71
ometimes
-0.70
surprisingly
-0.70
ItemImage
-0.65
uitive
-0.64
:=
-0.64
mittedly
-0.63
POSITIVE LOGITS
',"
1.81
!'"
1.77
'."
1.76
'"
1.68
.")
1.67
,'"
1.66
.'"
1.61
").
1.59
?'"
1.58
').
1.52
Activations Density 0.880%