INDEX
Explanations
mentions of the United States of America or things related to America
instances of the word "American."
New Auto-Interp
Negative Logits
fill
-0.76
fm
-0.74
Fill
-0.72
theless
-0.69
order
-0.69
ogyn
-0.69
COLOR
-0.68
ologically
-0.67
ĵĺ
-0.67
spring
-0.66
POSITIVE LOGITS
Embassy
1.01
embassy
1.00
Samoa
0.96
Airlines
0.95
embassies
0.90
Expedition
0.87
consulate
0.86
diplomats
0.84
ambassador
0.82
exile
0.81
Activations Density 0.039%