INDEX
Explanations
countries
references to specific nationalities or ethnic groups
New Auto-Interp
Negative Logits
rex
-0.97
apego
-0.93
ifice
-0.92
utics
-0.84
illance
-0.84
ologies
-0.83
utherford
-0.81
ivism
-0.81
anship
-0.76
pter
-0.76
POSITIVE LOGITS
istani
0.97
Portuguese
0.88
citiz
0.88
Nadu
0.85
nationals
0.77
entertain
0.77
civil
0.75
immigrant
0.75
embassies
0.74
intellig
0.73
Activations Density 0.035%