INDEX
Explanations
references to religious organizations and their activities
New Auto-Interp
Negative Logits
uck
-0.19
Ī
-0.15
eskort
-0.14
mmas
-0.14
ogh
-0.14
Wales
-0.14
azz
-0.14
ãģ¡ãģ¯
-0.14
nackte
-0.14
Tib
-0.14
POSITIVE LOGITS
Luther
0.30
Lutheran
0.27
syn
0.21
LC
0.21
Concord
0.21
Missouri
0.20
confess
0.20
Aug
0.20
EL
0.20
лÑİ
0.20
Activations Density 0.020%