INDEX
Explanations
assertions of capability or potential in various contexts
New Auto-Interp
Head Attr Weights
0:0.07
1:0.05
2:0.03
3:0.08
4:0.08
5:0.06
6:0.05
7:0.03
8:0.25
9:0.21
10:0.01
11:0.03
Negative Logits
spiked
-1.94
isexual
-1.91
bryce
-1.78
proport
-1.72
laced
-1.71
scorp
-1.62
田
-1.62
fur
-1.60
��
-1.59
%%
-1.58
POSITIVE LOGITS
icent
1.78
quickShipAvailable
1.77
ELL
1.75
learn
1.73
eenth
1.70
minster
1.67
success
1.64
assetsadobe
1.62
Secondary
1.61
SSL
1.58
Activations Density 0.001%