INDEX
Explanations
terms related to religious beliefs and practices
religious beliefs and practices
New Auto-Interp
Negative Logits
chrétiens
-0.63
Reformed
-0.60
Shalom
-0.59
CHRISTIAN
-0.59
cristianos
-0.56
chrétien
-0.56
cristiana
-0.55
Christians
-0.54
Christians
-0.54
espirituales
-0.53
POSITIVE LOGITS
OGND
0.54
shit
0.43
fuckin
0.37
spira
0.36
extrem
0.36
__((
0.36
coo
0.35
cr
0.35
fucking
0.35
اند
0.34
Activations Density 0.070%