INDEX
Explanations
references to United States history and politics
references to American history or culture
New Auto-Interp
Negative Logits
fielder
-0.80
ouf
-0.77
Dalai
-0.70
inhibitor
-0.70
uilt
-0.69
viks
-0.68
rogens
-0.64
Duchess
-0.63
Hour
-0.62
Mermaid
-0.62
POSITIVE LOGITS
society
1.33
politics
1.33
history
1.19
affairs
1.09
capitalism
1.08
folklore
1.08
circles
1.08
academia
1.07
culture
1.06
civilization
1.03
Activations Density 0.269%