INDEX
Explanations
religious terminology or discussions related to religion
references to religion
New Auto-Interp
Negative Logits
20439
-0.76
Lans
-0.70
msg
-0.69
Yon
-0.69
astern
-0.68
sg
-0.67
berry
-0.66
amaz
-0.66
ptives
-0.65
Packers
-0.64
POSITIVE LOGITS
religion
0.93
ophobia
0.84
worshipped
0.80
affiliation
0.80
scriptures
0.78
zai
0.75
fundamentalist
0.74
antry
0.73
ophobic
0.73
igion
0.73
Activations Density 0.009%