INDEX
Explanations
references to being native to a place or language
references to indigenous or native communities and their issues
New Auto-Interp
Negative Logits
=-=-=-=-
-0.76
ammy
-0.75
awed
-0.75
urat
-0.73
apter
-0.72
enario
-0.72
apego
-0.70
udence
-0.70
ridor
-0.69
ennes
-0.69
POSITIVE LOGITS
americ
1.00
born
0.83
Advertisement
0.82
spe
0.79
Instruments
0.77
izations
0.75
tongue
0.75
ivated
0.72
izer
0.70
Hawai
0.69
Activations Density 0.029%