INDEX
Explanations
mentions of the extremist group "Boko Haram"
references to Boko Haram
New Auto-Interp
Negative Logits
enegger
-0.78
less
-0.65
med
-0.64
drive
-0.64
orally
-0.63
eele
-0.63
Gutenberg
-0.63
ly
-0.63
nor
-0.62
Pathfinder
-0.61
POSITIVE LOGITS
Haram
1.20
¯
0.96
¦
0.95
ħ
0.94
¶
0.94
©¶æ¥µ
0.91
ĺ
0.87
³
0.86
®
0.85
¹
0.85
Activations Density 0.018%