INDEX
Explanations
events or situations involving conflict or confrontation
instances of conflict or confrontation
New Auto-Interp
Negative Logits
mental
-0.88
employment
-0.86
duct
-0.78
estate
-0.78
intestinal
-0.76
enfranch
-0.76
Balt
-0.74
vation
-0.73
ãģĹ
-0.71
secut
-0.70
POSITIVE LOGITS
halla
1.09
roy
0.90
clash
0.83
between
0.81
looms
0.81
Royale
0.79
Ambro
0.78
Amon
0.78
clashes
0.73
Qiao
0.71
Activations Density 0.027%