INDEX
    Explanations

    mentions of "sin" and related terms associated with wrongdoing or immorality

    New Auto-Interp
    Negative Logits
    ]--;
    -1.03
    ,:);
    -0.94
     Parcelable
    -0.93
     nakalista
    -0.92
    Geplaatst
    -0.92
     myſelf
    -0.91
     himſelf
    -0.91
     Walkover
    -0.90
     engraçadas
    -0.88
    ]]
    
    -0.88
    POSITIVE LOGITS
     sin
    2.13
     Sin
    1.97
    sin
    1.91
    Sin
    1.84
     SIN
    1.73
     sins
    1.63
    SIN
    1.49
     Sins
    1.30
     sinful
    1.22
     sinned
    1.21
    Act Density 0.081%

    No Known Activations