INDEX
Explanations
mentions of "sin" and related terms associated with wrongdoing or immorality
New Auto-Interp
Negative Logits
]--;
-1.03
,:);
-0.94
Parcelable
-0.93
nakalista
-0.92
Geplaatst
-0.92
myſelf
-0.91
himſelf
-0.91
Walkover
-0.90
engraçadas
-0.88
]]
-0.88
POSITIVE LOGITS
sin
2.13
Sin
1.97
sin
1.91
Sin
1.84
SIN
1.73
sins
1.63
SIN
1.49
Sins
1.30
sinful
1.22
sinned
1.21
Activations Density 0.081%