INDEX
Explanations
phrases related to conflicts involving two opposing sides
references to opposing parties or factions involved in a conflict
New Auto-Interp
Negative Logits
pta
-0.74
nce
-0.73
Delivery
-0.60
ķ
-0.60
Rapids
-0.60
awar
-0.57
rupted
-0.57
Miko
-0.57
DERR
-0.57
venient
-0.56
POSITIVE LOGITS
alike
1.06
equally
1.00
mutually
0.97
simultaneously
0.96
sides
0.96
sexes
0.96
vying
0.88
emate
0.85
agree
0.85
'
0.83
Activations Density 0.175%