INDEX
Explanations
references to mosques
references to mosques
New Auto-Interp
Negative Logits
dit
-0.83
lasses
-0.78
lust
-0.77
å¦
-0.76
ANC
-0.75
short
-0.72
AW
-0.71
bart
-0.69
lass
-0.69
cold
-0.68
POSITIVE LOGITS
mosque
1.19
mosques
1.10
Mosque
1.05
abad
0.98
mosqu
0.97
hammad
0.94
loudspe
0.92
istani
0.89
synagogue
0.84
cleric
0.82
Activations Density 0.012%