INDEX
    Explanations

    phrases related to legal issues or courtroom discussions

    New Auto-Interp
    Negative Logits
     change
    -1.20
    change
    -0.97
     switch
    -0.93
     CHANGE
    -0.90
     cambio
    -0.89
     Change
    -0.89
     shift
    -0.86
     changement
    -0.83
     changed
    -0.83
     changer
    -0.79
    POSITIVE LOGITS
     متعلقه
    0.83
     صوتيه
    0.75
    참고
    0.73
     الاطلاع
    0.72
    [toxicity=0]
    0.72
    findpost
    0.71
    出版年
    0.71
    +#+#
    0.70
     שוליים
    0.70
    Xna
    0.68
    Act Density 0.012%

    No Known Activations