INDEX
Explanations
references to international relations and comparisons between countries
Follows country names
countries and nationalities
New Auto-Interp
Negative Logits
Europe
-0.96
Europe
-0.91
Europeans
-0.89
Eropa
-0.87
europe
-0.86
EUROPE
-0.81
Europa
-0.79
أوروبا
-0.77
España
-0.75
Europas
-0.74
POSITIVE LOGITS
ComVisible
0.70
Paglinawan
0.65
Bhutan
0.63
صوتيه
0.60
gyz
0.59
Diwedd
0.57
__':
0.57
ibouti
0.56
uilla
0.56
̍t
0.55
Activations Density 0.245%