INDEX
Explanations
emotional expressions and sentiments related to loss and tribute
New Auto-Interp
Head Attr Weights
0:0.06
1:0.04
2:0.07
3:0.04
4:0.06
5:0.04
6:0.22
7:0.06
8:0.10
9:0.19
10:0.03
11:0.04
Negative Logits
heit
-3.61
Sok
-3.57
antz
-3.51
vec
-3.46
cho
-3.45
neuron
-3.30
Kling
-3.26
Hispanics
-3.24
Kov
-3.22
ć
-3.22
POSITIVE LOGITS
Royal
8.83
Royal
8.71
royal
7.07
roy
5.63
roy
5.50
Prin
4.95
monarchy
4.94
Queen
4.51
Princess
4.45
Queen
4.32
Activations Density 0.002%