INDEX
    Explanations

    affirmative responses indicating agreement or confirmation

    New Auto-Interp
    Negative Logits
    ibri
    -0.16
    ãĥªãĥ¼ãĤº
    -0.15
    Ļ
    -0.15
    otto
    -0.15
    reon
    -0.14
    .Typed
    -0.14
    ehr
    -0.14
    ibus
    -0.14
    tel
    -0.14
    xies
    -0.14
    POSITIVE LOGITS
    optera
    0.17
    InRange
    0.16
    zá
    0.15
    ény
    0.15
    iqu
    0.14
    :convert
    0.14
    bil
    0.13
     thang
    0.13
    letic
    0.13
    andr
    0.13
    Act Density 0.016%

    No Known Activations