INDEX
Explanations
the presence of specific identifiers and contextual references
New Auto-Interp
Head Attr Weights
0:0.02
1:0.03
2:0.12
3:0.04
4:0.03
5:0.06
6:0.10
7:0.30
8:0.11
9:0.03
10:0.07
11:0.04
Negative Logits
pend
-1.08
veland
-1.07
gee
-1.00
Provision
-0.97
Beg
-0.97
vette
-0.97
alore
-0.96
Koz
-0.94
bill
-0.93
oslav
-0.93
POSITIVE LOGITS
rontal
1.34
terness
1.20
ONSORED
1.09
LESS
1.09
Metatron
1.05
theless
1.02
Beaut
1.01
swers
1.00
comple
0.98
largeDownload
0.96
Activations Density 0.116%