INDEX
Explanations
terrorism and terrorist groups
New Auto-Interp
Negative Logits
ка
1.41
ри
1.25
ни
1.13
т
1.11
ει
1.10
و
1.00
ו
0.98
י
0.92
ي
0.91
ના
0.89
POSITIVE LOGITS
terrorism
1.07
1.07
terrorist
1.06
terrorists
0.85
Terrorism
0.85
জঙ্গি
0.84
9
0.80
Terror
0.76
terrorism
0.75
Terror
0.75
Activations Density 0.001%