INDEX
Explanations
mentions of the continent Europe
references to Europe
New Auto-Interp
Negative Logits
anan
-0.78
oru
-0.76
gur
-0.75
uilt
-0.74
acca
-0.74
yrights
-0.73
abee
-0.72
yright
-0.72
perse
-0.71
ibility
-0.71
POSITIVE LOGITS
Parliament
0.85
Union
0.77
Galile
0.76
countries
0.73
ophobia
0.73
continent
0.72
nations
0.72
Continent
0.71
Europe
0.71
ffen
0.68
Activations Density 0.016%