INDEX
    Explanations

    the word "for" in various contexts

    New Auto-Interp
    Negative Logits
    439
    -0.15
     Gors
    -0.15
    57
    -0.15
    39
    -0.15
     ru
    -0.14
     process
    -0.14
    65
    -0.14
     Kavanaugh
    -0.14
    xit
    -0.14
     CType
    -0.14
    POSITIVE LOGITS
    acer
    0.18
    unately
    0.16
    kees
    0.16
    aml
    0.15
    ahl
    0.15
    aland
    0.15
    earn
    0.15
    bak
    0.15
    मà¤ķ
    0.15
    kiye
    0.14
    Act Density 0.511%

    No Known Activations