INDEX
Explanations
names or terms related to people of Asian descent
New Auto-Interp
Negative Logits
Berks
-0.97
Harris
-0.86
meteor
-0.83
Mars
-0.74
Schr
-0.74
Philadelphia
-0.73
rolet
-0.70
Sch
-0.70
Ram
-0.69
comet
-0.66
POSITIVE LOGITS
Tong
4.04
tong
1.66
Wong
1.48
Fiji
1.41
tongues
1.32
Samoa
1.30
Yong
1.25
Brune
1.11
Gou
1.09
Mold
1.08
Activations Density 0.053%