INDEX
Explanations
references to boundaries and related political terms
references to geographic boundaries or divisions
New Auto-Interp
Negative Logits
hiba
-0.90
eah
-0.89
ems
-0.88
orah
-0.87
ety
-0.82
cffffcc
-0.82
efully
-0.81
etooth
-0.80
heed
-0.80
ebin
-0.78
POSITIVE LOGITS
Kane
0.90
VO
0.66
Allan
0.66
Lauder
0.65
Hebdo
0.65
Ware
0.64
Dame
0.63
Clear
0.63
naire
0.63
Wast
0.62
Activations Density 0.123%