INDEX
Explanations
phrases or concepts related to going "above and beyond."
New Auto-Interp
Head Attr Weights
0:0.01
1:0.02
2:0.12
3:0.29
4:0.01
5:0.01
6:0.09
7:0.05
8:0.09
9:0.12
10:0.04
11:0.09
Negative Logits
olkien
-1.30
clerosis
-1.26
Sweeney
-1.20
differently
-1.18
weeney
-1.17
sole
-1.15
bnb
-1.10
Genie
-1.06
Tea
-1.03
arios
-1.02
POSITIVE LOGITS
�
1.24
scenes
1.21
horizon
1.15
ctors
1.12
=-=-
1.11
orest
1.08
Scenes
1.06
Holo
1.01
parap
1.01
empt
1.01
Activations Density 0.014%