INDEX
Explanations
terms related to sectarianism and communal divisions
New Auto-Interp
Negative Logits
cham
-0.80
ĸļ
-0.80
berman
-0.78
know
-0.75
ynthesis
-0.75
ODUCT
-0.74
cript
-0.72
erm
-0.71
AUT
-0.70
Prototype
-0.70
POSITIVE LOGITS
strife
1.08
militias
1.03
ities
1.02
ism
1.02
warfare
0.97
affiliation
0.91
divides
0.90
violence
0.89
affili
0.88
relations
0.88
Activations Density 0.008%