INDEX
Explanations
mentions of rebel groups or insurgency
New Auto-Interp
Negative Logits
ographies
-0.81
ocobo
-0.77
è¯
-0.74
uchin
-0.73
thora
-0.72
icrobial
-0.69
gow
-0.67
Safety
-0.67
Hilbert
-0.66
Robo
-0.66
POSITIVE LOGITS
factions
1.07
rebels
1.01
faction
0.95
militias
0.95
milit
0.94
rebel
0.93
fighters
0.92
stronghold
0.91
strongh
0.88
army
0.87
Activations Density 0.024%