INDEX
Explanations
negative emotions and situations such as struggling, hating, feeling horrible, and being sentenced
instances of emotional distress or struggle
New Auto-Interp
Negative Logits
UF
-0.69
raft
-0.69
ourse
-0.68
SpaceEngineers
-0.65
cott
-0.64
concess
-0.63
urrent
-0.63
conom
-0.63
Claim
-0.63
asury
-0.62
POSITIVE LOGITS
huh
1.17
haha
1.11
yeah
1.02
but
1.00
eh
0.99
oh
0.96
albeit
0.93
though
0.92
blah
0.91
maybe
0.88
Activations Density 0.535%