INDEX
Explanations
references to storytelling and narrative elements
New Auto-Interp
Head Attr Weights
0:0.03
1:0.02
2:0.12
3:0.09
4:0.13
5:0.03
6:0.05
7:0.34
8:0.02
9:0.03
10:0.05
11:0.04
Negative Logits
ワ
-1.67
speeds
-1.62
Zup
-1.57
Newport
-1.52
ISO
-1.52
Elections
-1.51
obal
-1.43
vironments
-1.42
distances
-1.39
cloaked
-1.38
POSITIVE LOGITS
precedent
1.67
trend
1.62
backer
1.59
dogs
1.58
erva
1.58
favourites
1.57
disappoint
1.57
favorites
1.56
widget
1.52
filler
1.52
Activations Density 0.016%