INDEX
Explanations
mentions of European nationalities
references to specific nationalities or ethnicities
New Auto-Interp
Negative Logits
odder
-0.86
icago
-0.85
nyder
-0.85
utherford
-0.84
ertodd
-0.83
affles
-0.81
mble
-0.81
iscons
-0.79
ividual
-0.79
uder
-0.78
POSITIVE LOGITS
oslov
0.98
Nadu
0.89
nationals
0.88
shepherd
0.84
translation
0.83
cuisine
0.82
accent
0.81
proverb
0.81
Portuguese
0.79
istani
0.78
Activations Density 0.139%