INDEX
Explanations
words related to the religion Islam
mentions of the religion Islam
New Auto-Interp
Negative Logits
Rober
-0.76
WD
-0.71
Seah
-0.69
TY
-0.68
SPL
-0.68
EAR
-0.67
Member
-0.67
berries
-0.66
EAR
-0.65
asper
-0.63
POSITIVE LOGITS
ophobic
1.61
ophobia
1.56
ophob
1.25
abad
1.15
zai
0.96
ification
0.94
ics
0.94
icum
0.93
etics
0.91
ocide
0.91
Activations Density 0.017%