INDEX
Explanations
mentions of Native American-related terms
references to Native American topics or issues
New Auto-Interp
Negative Logits
laden
-0.76
ammers
-0.74
zag
-0.71
ordon
-0.71
gloom
-0.70
fax
-0.69
leck
-0.68
hall
-0.68
dam
-0.68
Carlo
-0.68
POSITIVE LOGITS
Native
3.85
Native
3.02
Navajo
1.97
Indigenous
1.88
native
1.86
native
1.79
aboriginal
1.69
natives
1.66
indigenous
1.64
Aboriginal
1.63
Activations Density 0.022%