INDEX
Explanations
negative emotions and sentiments related to guilt, sadness, and insecurity
expressions of negative emotions and states of mind
New Auto-Interp
Negative Logits
uria
-0.70
defic
-0.65
arcity
-0.60
Tobacco
-0.60
eatures
-0.60
Explore
-0.58
eu
-0.58
Provided
-0.57
utions
-0.57
vantage
-0.57
POSITIVE LOGITS
ABOUT
1.25
about
1.25
bc
1.06
wondering
1.04
thinking
0.96
About
0.96
imagining
0.94
about
0.94
watching
0.91
because
0.88
Activations Density 0.265%