INDEX
Explanations
references to Islamic entities or concepts
New Auto-Interp
Negative Logits
Islam
-0.21
Islam
-0.20
muslim
-0.17
Muslim
-0.17
Muslim
-0.17
Islamic
-0.17
Islamist
-0.16
Islamic
-0.16
Muslims
-0.16
iale
-0.16
POSITIVE LOGITS
ate
0.19
fundamental
0.18
ization
0.18
Relief
0.16
ÑĦÑĥн
0.16
isation
0.16
Republic
0.16
ized
0.15
fundament
0.15
State
0.15
Activations Density 0.006%