INDEX
Explanations
references to religious beliefs and practices
New Auto-Interp
Negative Logits
оваÑĢ
-0.16
HasBeen
-0.15
ursed
-0.15
ahas
-0.15
isse
-0.15
addCriterion
-0.15
raÄį
-0.15
zas
-0.14
ronym
-0.14
relude
-0.14
POSITIVE LOGITS
conversion
0.30
religion
0.30
converted
0.26
faith
0.26
Conversion
0.25
conversions
0.24
çļ
0.24
convert
0.24
Christianity
0.24
converting
0.23
Activations Density 0.174%