INDEX
    Explanations

    verbs indicating contemplation or consideration

    phrases that initiate questions or address the reader's curiosity

    New Auto-Interp
    Negative Logits
    nown
    -0.71
    ©¶æ¥µ
    -0.67
    imar
    -0.66
     realise
    -0.64
    ÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤ
    -0.64
    pex
    -0.64
    qua
    -0.62
    udeb
    -0.62
    %.
    -0.62
    tu
    -0.61
    POSITIVE LOGITS
     warr
    0.66
     anything
    0.64
     dessert
    0.63
     exclus
    0.62
     inspiration
    0.62
     yourself
    0.62
     finer
    0.60
     throne
    0.60
     athlet
    0.60
     caffeine
    0.58
    Act Density 0.359%

    No Known Activations