INDEX
Explanations
words related to physical locations or references to "in."
New Auto-Interp
Head Attr Weights
0:0.09
1:0.02
2:0.08
3:0.05
4:0.13
5:0.13
6:0.03
7:0.02
8:0.16
9:0.16
10:0.06
11:0.02
Negative Logits
suspic
-1.30
traged
-1.21
Seym
-1.08
enthusi
-1.06
hail
-1.02
ideos
-1.00
wcs
-0.99
conscientious
-0.97
icter
-0.96
distur
-0.96
POSITIVE LOGITS
ahime
1.35
ryu
1.31
etus
1.19
usters
1.12
ling
1.11
tein
1.11
gram
1.10
hod
1.09
iao
1.08
istics
1.06
Activations Density 0.047%