INDEX
Explanations
mentions of the word "Story"
New Auto-Interp
Head Attr Weights
0:0.06
1:0.10
2:0.07
3:0.08
4:0.06
5:0.08
6:0.08
7:0.06
8:0.08
9:0.09
10:0.09
11:0.10
Negative Logits
Regional
-1.67
Investig
-1.65
Letters
-1.64
Born
-1.61
Research
-1.61
Abu
-1.60
Concern
-1.58
Investigations
-1.57
Organisation
-1.57
Latest
-1.56
POSITIVE LOGITS
utherland
1.97
icing
1.96
zbollah
1.76
kins
1.72
wives
1.71
CVE
1.70
muff
1.69
lime
1.66
myra
1.65
ivas
1.64
Activations Density 0.000%