INDEX
    Explanations

    phrases related to controversy or conflict

    phrases indicating moral dilemmas or ethical conflicts

    New Auto-Interp
    Negative Logits
    ahu
    -0.74
    utral
    -0.67
     Mechdragon
    -0.66
    iple
    -0.63
    USS
    -0.61
    Js
    -0.58
    é¾
    -0.58
    Wire
    -0.57
    ¬¼
    -0.57
     corrid
    -0.57
    POSITIVE LOGITS
     namely
    1.22
     albeit
    1.08
     despite
    0.94
     viz
    0.93
     nor
    0.92
     especially
    0.91
     irrespective
    0.91
     regardless
    0.90
     lest
    0.90
    especially
    0.88
    Act Density 0.386%

    No Known Activations