INDEX
Explanations
references to social media links or content sharing
New Auto-Interp
Head Attr Weights
0:0.01
1:0.02
2:0.05
3:0.09
4:0.07
5:0.03
6:0.24
7:0.20
8:0.04
9:0.05
10:0.06
11:0.08
Negative Logits
fman
-1.74
Savannah
-1.44
soDeliveryDate
-1.33
Winc
-1.25
Gore
-1.23
ritch
-1.22
Canal
-1.22
ghan
-1.22
Ames
-1.22
dred
-1.21
POSITIVE LOGITS
exit
1.77
Status
1.45
guiActiveUnfocused
1.44
rocal
1.34
Response
1.28
�
1.27
etheless
1.26
arrival
1.24
ritical
1.23
riage
1.23
Activations Density 0.001%