INDEX
Explanations
elements related to conflict and sacrifice in narratives
New Auto-Interp
Head Attr Weights
0:0.06
1:0.12
2:0.19
3:0.04
4:0.04
5:0.02
6:0.04
7:0.31
8:0.02
9:0.03
10:0.06
11:0.03
Negative Logits
Fresno
-1.97
Blanc
-1.86
nineteenth
-1.82
cheering
-1.80
fo
-1.79
erey
-1.77
eenth
-1.76
arf
-1.74
protested
-1.72
Falk
-1.71
POSITIVE LOGITS
cknow
1.97
odon
1.95
˜
1.86
CDC
1.84
designers
1.83
ody
1.81
Design
1.80
OTOS
1.79
otom
1.79
Detect
1.79
Activations Density 0.017%