INDEX
Explanations
notable actions or events involving courage and heroism
New Auto-Interp
Head Attr Weights
0:0.09
1:0.02
2:0.05
3:0.03
4:0.05
5:0.03
6:0.25
7:0.05
8:0.07
9:0.28
10:0.02
11:0.03
Negative Logits
oleon
-4.23
Hugo
-3.88
Louie
-3.74
subst
-3.54
Yug
-3.53
rosis
-3.43
hypothal
-3.37
opath
-3.35
RO
-3.29
alg
-3.27
POSITIVE LOGITS
Bennett
11.70
ennett
7.43
Benn
5.64
Berman
5.39
Barrett
5.37
Boyd
5.17
Berry
4.99
Byrd
4.84
Banner
4.47
Barnett
4.45
Activations Density 0.004%