INDEX
    Explanations

    phrases that emphasize the concept of "for" indicating purpose or function

    New Auto-Interp
    Negative Logits
    ing
    -0.16
    606
    -0.15
    ING
    -0.15
    ll
    -0.15
    illez
    -0.15
    sgiving
    -0.15
    abh
    -0.14
    ariance
    -0.14
     me
    -0.13
    stm
    -0.13
    POSITIVE LOGITS
    amed
    0.17
     Beste
    0.16
    isz
    0.15
    .scalablytyped
    0.14
    ī
    0.14
    aniel
    0.13
     Admir
    0.13
    oenix
    0.13
    eya
    0.13
    agos
    0.13
    Act Density 0.052%

    No Known Activations