INDEX
Explanations
mentions of the Muslim community
mentions of the Muslim community and related themes
New Auto-Interp
Negative Logits
amina
-0.90
æ©
-0.84
ructure
-0.83
WD
-0.80
Rockefeller
-0.78
angled
-0.76
CLA
-0.75
CAP
-0.75
dated
-0.73
ATA
-0.72
POSITIVE LOGITS
istani
1.09
cleric
0.95
shooters
0.92
hammad
0.91
Muslim
0.90
immigrant
0.89
Muslims
0.89
clerics
0.88
ophobic
0.87
hijab
0.85
Activations Density 0.010%