INDEX
Explanations
phrases that convey a sense of nightmare or negative scenarios
New Auto-Interp
Head Attr Weights
0:0.02
1:0.01
2:0.07
3:0.05
4:0.13
5:0.03
6:0.04
7:0.37
8:0.02
9:0.03
10:0.08
11:0.10
Negative Logits
emphasis
-1.70
quickShipAvailable
-1.59
Interested
-1.56
aples
-1.55
est
-1.47
neutral
-1.45
baugh
-1.42
hement
-1.42
POSE
-1.39
observations
-1.36
POSITIVE LOGITS
neighb
1.48
��
1.45
��
1.42
Zucker
1.42
babys
1.42
skelet
1.31
�
1.31
Thro
1.30
importing
1.29
bureaucracy
1.28
Activations Density 0.001%