INDEX
Explanations
instances of the word "Islam" along with closely related discussions or sentiments
references to the term "Islam."
New Auto-Interp
Negative Logits
Rober
-0.81
WD
-0.71
Seah
-0.69
EAR
-0.65
DEC
-0.65
asper
-0.65
TY
-0.64
EAR
-0.62
Rockefeller
-0.62
dain
-0.61
POSITIVE LOGITS
ophobic
1.58
ophobia
1.52
ophob
1.20
abad
1.10
zai
0.95
ics
0.93
etics
0.91
icals
0.89
obia
0.89
ification
0.88
Activations Density 0.009%