INDEX
Explanations
mentions of locations or directions indicating a physical place
New Auto-Interp
Head Attr Weights
0:0.05
1:0.02
2:0.15
3:0.07
4:0.21
5:0.09
6:0.03
7:0.02
8:0.06
9:0.16
10:0.05
11:0.03
Negative Logits
��
-1.53
Downloadha
-1.36
leases
-1.35
etheless
-1.35
��
-1.31
etimes
-1.29
��
-1.26
pload
-1.25
enh
-1.23
SN
-1.20
POSITIVE LOGITS
sci
1.51
imester
1.47
thouse
1.39
rimination
1.35
agascar
1.30
#$
1.27
�
1.27
INTON
1.26
inian
1.23
gasp
1.22
Activations Density 0.007%