INDEX
    Explanations

    the word "or" used in various contexts

    New Auto-Interp
    Negative Logits
    Äįil
    -0.16
    ilor
    -0.16
    505
    -0.15
    iled
    -0.15
    ils
    -0.15
    ãĥ¥ãĥ¼
    -0.14
    ãĤĨ
    -0.14
    imized
    -0.14
    ilt
    -0.14
    notated
    -0.14
    POSITIVE LOGITS
     two
    0.31
    two
    0.24
     couple
    0.23
    -two
    0.21
    两个
    0.20
     deux
    0.20
     zwei
    0.20
     few
    0.20
    两
    0.19
    åħ©
    0.19
    Act Density 0.022%

    No Known Activations