INDEX
Explanations
instances of dialogue or quotes in the text
New Auto-Interp
Head Attr Weights
0:0.04
1:0.02
2:0.09
3:0.12
4:0.07
5:0.05
6:0.05
7:0.21
8:0.05
9:0.07
10:0.10
11:0.08
Negative Logits
urat
-1.65
otherapy
-1.63
aci
-1.50
mur
-1.49
flix
-1.49
issan
-1.48
simplicity
-1.46
UFC
-1.45
evasion
-1.44
brance
-1.44
POSITIVE LOGITS
soever
1.82
Attempt
1.58
Coverage
1.58
Latest
1.57
Hours
1.56
taboola
1.54
Initial
1.52
Visit
1.48
loads
1.44
Times
1.44
Activations Density 0.001%