INDEX
    Explanations

    phonetic mismatches

    New Auto-Interp
    Negative Logits
    ©¶æ
    -0.88
     beneficiary
    -0.70
    benef
    -0.68
    figured
    -0.67
     unsuspecting
    -0.64
    pelling
    -0.63
     dart
    -0.63
    ulnerable
    -0.63
     darts
    -0.63
    catch
    -0.63
    POSITIVE LOGITS
    atted
    0.84
    ebook
    0.78
    enez
    0.75
    TN
    0.74
    hetti
    0.73
    auga
    0.73
    ioch
    0.73
    ettes
    0.73
    ione
    0.72
    ICAN
    0.71
    Act Density 0.018%

    No Known Activations