INDEX
Explanations
references to specific geographic locations and capitals
New Auto-Interp
Negative Logits
Croatia
-0.16
utz
-0.16
ocal
-0.15
Yugoslavia
-0.15
Neville
-0.14
Catalonia
-0.14
orthy
-0.14
Louisiana
-0.14
Descriptor
-0.14
Ventura
-0.14
POSITIVE LOGITS
Islamabad
0.28
capitals
0.23
Ankara
0.23
Kabul
0.22
capital
0.22
Moscow
0.20
Pret
0.20
Capitals
0.20
Jakarta
0.19
Canberra
0.19
Activations Density 0.185%