INDEX
Explanations
various forms of the word "marginal."
New Auto-Interp
Head Attr Weights
0:0.03
1:0.03
2:0.04
3:0.06
4:0.05
5:0.05
6:0.40
7:0.05
8:0.04
9:0.07
10:0.07
11:0.05
Negative Logits
erness
-1.37
Downloadha
-1.26
captcha
-1.26
vision
-1.25
prospect
-1.23
largeDownload
-1.21
stown
-1.20
imaru
-1.16
undred
-1.15
ouver
-1.15
POSITIVE LOGITS
ciation
1.35
assi
1.30
————
1.26
Tsukuyomi
1.23
osi
1.23
OTAL
1.22
ophe
1.20
ée
1.19
vich
1.18
————————
1.17
Activations Density 0.004%