INDEX
Explanations
references to Muslims and related cultural or religious terms and events
New Auto-Interp
Negative Logits
psz
-0.56
vang
-0.54
Ife
-0.51
Tartu
-0.51
Ams
-0.49
ylen
-0.49
Welsh
-0.49
Kla
-0.49
Labrador
-0.48
Aval
-0.48
POSITIVE LOGITS
Infórmanos
0.92
mosques
0.89
ArrowToggle
0.89
Islam
0.88
مشين
0.87
mosque
0.87
Mohammed
0.87
Mosque
0.85
Mohammed
0.85
AsUp
0.83
Activations Density 0.340%