INDEX
Explanations
mentions of domestic-related terms or issues
references to "domestic" issues or contexts
New Auto-Interp
Negative Logits
uden
-0.96
uyomi
-0.93
*/(
-0.83
umper
-0.82
ombs
-0.78
osen
-0.78
edin
-0.75
kson
-0.74
GOODMAN
-0.74
atto
-0.74
POSITIVE LOGITS
Violence
1.02
affairs
0.94
violence
0.93
tranqu
0.86
appliances
0.81
violence
0.79
combustion
0.77
terrorism
0.76
estic
0.75
ility
0.73
Activations Density 0.023%