INDEX
Explanations
adjectives related to nationalities
references to nationalities and ethnicities
New Auto-Interp
Negative Logits
ueller
-0.93
uder
-0.88
nyder
-0.86
nels
-0.84
utherford
-0.84
ertodd
-0.83
lease
-0.82
vable
-0.82
umbnails
-0.82
uer
-0.81
POSITIVE LOGITS
oslov
0.96
cuisine
0.95
Nadu
0.90
accent
0.86
Portuguese
0.84
istani
0.84
mystic
0.84
monk
0.83
nationals
0.82
Orthodox
0.82
Activations Density 0.154%