INDEX
    Explanations

    the word "moment."

    the presence of the substring "mo" in words

    New Auto-Interp
    Negative Logits
    Ĭ
    -0.64
     Ash
    -0.63
     forged
    -0.62
     Parenthood
    -0.61
     Dat
    -0.60
     belonging
    -0.59
     Paragu
    -0.59
     graves
    -0.59
     deficit
    -0.58
     sockets
    -0.58
    POSITIVE LOGITS
    mo
    4.57
    mos
    1.90
    MO
    1.90
    webkit
    1.74
    Mo
    1.59
     mo
    1.54
    mon
    1.33
    emo
    1.30
    mic
    1.29
    ma
    1.29
    Act Density 0.014%

    No Known Activations