INDEX
Explanations
references to actions or situations pertaining to peace or peaceful resolutions
references to peace and peacefulness
New Auto-Interp
Negative Logits
MAC
-0.91
attr
-0.80
olog
-0.78
WD
-0.76
VM
-0.74
Sales
-0.74
ANA
-0.73
drivers
-0.72
odor
-0.71
aan
-0.70
POSITIVE LOGITS
peaceful
1.25
peacefully
1.01
agre
0.87
peace
0.86
nonviolent
0.83
edIn
0.81
treaty
0.78
peace
0.78
minded
0.78
bystand
0.78
Activations Density 0.006%