INDEX
Explanations
mentions of the Protestant religion
references to Protestant denominations and related terms
New Auto-Interp
Negative Logits
adr
-0.80
amber
-0.76
enta
-0.76
phies
-0.76
ombie
-0.75
oha
-0.75
uable
-0.75
berman
-0.74
DonaldTrump
-0.74
ffen
-0.74
POSITIVE LOGITS
Crom
0.82
Episcopal
0.79
isms
0.76
ism
0.75
SourceFile
0.74
esses
0.74
LIN
0.71
ness
0.70
nian
0.68
Methodist
0.68
Activations Density 0.042%