INDEX
Explanations
mentions of the word "domestic"
references to domestic issues or topics
New Auto-Interp
Negative Logits
uden
-0.89
uyomi
-0.84
umper
-0.81
ombs
-0.79
*/(
-0.79
osen
-0.77
edin
-0.74
kson
-0.71
atto
-0.70
GOODMAN
-0.70
POSITIVE LOGITS
Violence
1.00
affairs
0.93
violence
0.86
Domestic
0.79
violence
0.78
appliances
0.76
domestic
0.76
tranqu
0.75
combustion
0.74
law
0.73
Activations Density 0.011%