INDEX
Explanations
personal opinions, particularly those expressing frustration or disagreement
emotional expressions and reactions
New Auto-Interp
Negative Logits
xtap
-0.81
etheless
-0.75
assetsadobe
-0.73
surprisingly
-0.70
ometimes
-0.67
uitive
-0.67
rition
-0.65
prisingly
-0.64
:=
-0.63
upon
-0.62
POSITIVE LOGITS
"]
1.84
',"
1.82
").
1.81
.")
1.75
")
1.72
"),
1.71
'"
1.70
,'"
1.68
)",
1.65
!".
1.63
Activations Density 1.029%