INDEX
Explanations
proper nouns related to the Americas
words related to American identity or affiliations
New Auto-Interp
Negative Logits
fax
-0.97
sie
-0.69
igne
-0.69
Foods
-0.66
oping
-0.62
margin
-0.62
meat
-0.61
isers
-0.61
cember
-0.61
orders
-0.60
POSITIVE LOGITS
icans
1.01
ican
0.97
ICAN
0.95
ushi
0.87
icas
0.87
gency
0.85
ilar
0.80
ility
0.79
tu
0.79
gdala
0.77
Activations Density 0.076%