INDEX
    Explanations

    phrases indicating uncertainty or questioning

    New Auto-Interp
    Negative Logits
    <%=
    -0.51
    ว์
    -0.51
    iecie
    -0.51
     đại
    -0.51
    ศาสตร์
    -0.50
     Peters
    -0.50
     P
    -0.48
    IMDG
    -0.48
    mezzo
    -0.48
    дые
    -0.48
    POSITIVE LOGITS
     frankly
    0.93
     honestly
    0.86
    Honestly
    0.86
     Honestly
    0.80
    honestly
    0.77
    Frankly
    0.76
    really
    0.74
     really
    0.73
    ScopeManager
    0.73
     admit
    0.71
    Act Density 0.127%

    No Known Activations