INDEX
Explanations
themes related to expressing frustration or resignation
New Auto-Interp
Head Attr Weights
0:0.02
1:0.01
2:0.08
3:0.06
4:0.14
5:0.03
6:0.08
7:0.34
8:0.03
9:0.04
10:0.07
11:0.05
Negative Logits
aughtered
-1.67
FORMATION
-1.62
tumblr
-1.58
代
-1.56
rive
-1.52
erial
-1.49
pict
-1.49
Skill
-1.44
converter
-1.44
SPONSORED
-1.41
POSITIVE LOGITS
politely
1.94
nervously
1.91
cynicism
1.80
gladly
1.77
scorn
1.75
gloom
1.72
goodbye
1.68
happily
1.68
indifference
1.68
emphatically
1.67
Activations Density 0.001%