INDEX
    Explanations

    the word "for" in various contexts

    New Auto-Interp
    Negative Logits
    mite
    -0.15
    λει
    -0.15
    erta
    -0.15
    ãĤĪãģĨãģª
    -0.15
    etting
    -0.14
    maker
    -0.14
    eno
    -0.14
    usercontent
    -0.14
    kil
    -0.14
     необÑħодимоÑģÑĤи
    -0.13
    POSITIVE LOGITS
     purposes
    0.49
     sake
    0.41
     instance
    0.37
    -profit
    0.34
     reasons
    0.34
     example
    0.33
    /by
    0.32
    bidden
    0.32
    aging
    0.31
    ays
    0.31
    Act Density 0.734%

    No Known Activations