INDEX
    Explanations

    phrases that indicate personal responsibility or suggestions for action

    New Auto-Interp
    Negative Logits
    الحياه
    -0.83
    UnusedPrivate
    -0.54
     henkilö
    -0.52
     EconPapers
    -0.52
     cotone
    -0.51
    rzost
    -0.51
     Sabina
    -0.50
    الدراسه
    -0.49
     näytte
    -0.48
     tournant
    -0.48
    POSITIVE LOGITS
    Chooser
    0.69
     متعلقه
    0.67
     hurry
    0.60
    脚注の使い方
    0.58
     devriez
    0.56
     Pretty
    0.55
     PROCEED
    0.53
    mtd
    0.53
     widerrufen
    0.52
     should
    0.51
    Act Density 0.221%

    No Known Activations