INDEX
    Explanations

    occurrences of the word "for" and other prepositions indicating purpose or reason

    New Auto-Interp
    Negative Logits
    aks
    -0.17
    ezier
    -0.15
    017
    -0.14
    MOTE
    -0.13
    iram
    -0.13
    åĬ¨çĶŁæĪIJ
    -0.13
    okt
    -0.13
    ìĿ´ìķ¼
    -0.12
    antry
    -0.12
    449
    -0.12
    POSITIVE LOGITS
    zyst
    0.17
    ello
    0.17
    олоÑģ
    0.16
    YPRE
    0.14
    /us
    0.14
    ippers
    0.14
    usercontent
    0.14
    osite
    0.14
    iba
    0.14
    ools
    0.14
    Act Density 0.035%

    No Known Activations