INDEX
Explanations
references to specific district designations or classifications
New Auto-Interp
Negative Logits
ragon
-0.19
aten
-0.18
loy
-0.17
imi
-0.16
isy
-0.16
Sirius
-0.15
ark
-0.15
ler
-0.15
own
-0.15
warf
-0.14
POSITIVE LOGITS
illard
0.19
elsea
0.18
etro
0.17
yer
0.16
USD
0.16
eded
0.16
enny
0.16
/fw
0.16
acus
0.15
resher
0.15
Activations Density 0.028%