INDEX
Explanations
references to significant statistical data or figures
New Auto-Interp
Head Attr Weights
0:0.07
1:0.06
2:0.09
3:0.07
4:0.08
5:0.08
6:0.08
7:0.08
8:0.08
9:0.08
10:0.10
11:0.07
Negative Logits
hooks
-1.70
fab
-1.55
tricked
-1.55
Detect
-1.54
mouse
-1.54
detects
-1.45
fusion
-1.45
coax
-1.45
bait
-1.45
hoped
-1.44
POSITIVE LOGITS
lections
1.72
VERTISEMENT
1.70
lesiastical
1.70
ittal
1.66
thood
1.66
aucuses
1.66
outwe
1.63
disqualified
1.63
disqual
1.62
auga
1.62
Activations Density 0.000%