INDEX
    Explanations

    conversational expressions conveying uncertainty or reluctance

    New Auto-Interp
    Negative Logits
    erval
    -0.15
     èIJ
    -0.15
    Interval
    -0.15
     Interval
    -0.15
    ivar
    -0.14
    .bt
    -0.14
    anel
    -0.14
     wow
    -0.14
    wow
    -0.14
    alth
    -0.13
    POSITIVE LOGITS
     who
    0.24
     beg
    0.24
     screw
    0.23
    who
    0.22
     fine
    0.22
     thems
    0.20
     shrugged
    0.20
    fine
    0.19
     Meh
    0.19
     Fine
    0.18
    Act Density 0.298%

    No Known Activations