INDEX
Explanations
references to capital cities and their specific locations
New Auto-Interp
Negative Logits
Kerala
-0.16
Maharashtra
-0.15
Neville
-0.15
Arkansas
-0.15
Croatia
-0.15
oulouse
-0.14
opes
-0.14
Thailand
-0.14
Louisiana
-0.14
CLE
-0.14
POSITIVE LOGITS
Islamabad
0.23
Bog
0.20
Acc
0.20
Pret
0.20
capital
0.20
Alg
0.20
Add
0.19
Mog
0.18
Dak
0.18
Rab
0.17
Activations Density 0.184%