INDEX
Explanations
proper nouns of different nationalities and ethnicities
references to specific nationalities or ethnicities
New Auto-Interp
Negative Logits
oral
-0.71
olate
-0.71
headquarters
-0.65
appa
-0.64
Kraft
-0.63
ms
-0.63
ED
-0.63
Downtown
-0.62
UM
-0.62
ija
-0.62
POSITIVE LOGITS
Spani
3.53
Frenchman
2.40
Swed
1.83
Scots
1.66
Californ
1.45
Britons
1.45
Dane
1.43
Italians
1.43
Scot
1.37
Pole
1.36
Activations Density 0.047%