INDEX
    Explanations

    acknowledging user's underlying intent

    New Auto-Interp
    Negative Logits
     הח
    0.61
     Always
    0.59
     Features
    0.58
     souvent
    0.58
     тип
    0.58
     Lorsque
    0.57
    вніш
    0.57
     завжди
    0.56
     vaak
    0.56
    組み合わせ
    0.56
    POSITIVE LOGITS
     gauging
    0.80
     assessing
    0.79
     shouldn
    0.78
     congrat
    0.76
     questioning
    0.76
     objectively
    0.75
     acknowledging
    0.74
     vetting
    0.73
     receiving
    0.73
     alleviating
    0.73
    Act Density 0.090%

    No Known Activations