INDEX
    Explanations

    the preposition "for" in various contexts

    New Auto-Interp
    Negative Logits
    lest
    -0.17
    ali
    -0.16
    le
    -0.16
    hle
    -0.15
    bury
    -0.14
     ru
    -0.14
    Pel
    -0.14
     mini
    -0.14
    ä¹Ļ
    -0.14
    ivar
    -0.13
    POSITIVE LOGITS
    arcer
    0.19
    ileÅŁ
    0.17
    ози
    0.16
    Ñİк
    0.15
    ewis
    0.15
    unde
    0.15
    iente
    0.15
    icker
    0.14
    reme
    0.14
    DataStream
    0.14
    Act Density 0.010%

    No Known Activations