INDEX
Explanations
references to secular concepts or entities
occurrences and discussions of secularism
New Auto-Interp
Negative Logits
è¦ļéĨĴ
-0.84
externalActionCode
-0.79
GAME
-0.77
hov
-0.73
upon
-0.73
CAP
-0.69
amaz
-0.69
Downloadha
-0.67
Phones
-0.67
Lans
-0.66
POSITIVE LOGITS
ization
1.13
stagnation
1.08
ized
1.07
ity
1.07
ists
1.05
ism
1.04
tarian
1.02
izing
1.00
icals
0.98
ised
0.97
Activations Density 0.013%