INDEX
    Explanations

    occurrences of various verbs and adverbs that indicate actions or states relating to necessity and intention

    New Auto-Interp
    Negative Logits
    yat
    -0.16
    ç´Ģ
    -0.16
    eph
    -0.15
    ynes
    -0.15
    yle
    -0.15
    lings
    -0.15
    arget
    -0.14
    just
    -0.14
     Cons
    -0.14
     Norm
    -0.14
    POSITIVE LOGITS
    idlo
    0.18
     CHIP
    0.17
    CHIP
    0.16
    qed
    0.16
    .scalablytyped
    0.15
    iband
    0.15
    OTOS
    0.15
    idth
    0.15
     Retrie
    0.15
    ÑĤон
    0.15
    Act Density 0.002%

    No Known Activations