INDEX
Explanations
mentions of locations or people's origins
occurrences of the term "native" in various contexts
New Auto-Interp
Negative Logits
ATA
-0.79
eper
-0.74
=-=-=-=-
-0.73
ATHER
-0.71
ENA
-0.70
ammy
-0.70
exha
-0.69
urat
-0.68
phasis
-0.68
uyomi
-0.68
POSITIVE LOGITS
native
1.02
native
0.97
oise
0.86
born
0.79
spe
0.77
americ
0.77
Instruments
0.76
Advertisement
0.75
itect
0.72
lucent
0.70
Activations Density 0.007%