INDEX
    Explanations

    possessive forms of nouns

    New Auto-Interp
    Negative Logits
    arness
    -0.19
    dre
    -0.16
    hin
    -0.15
    ange
    -0.15
    utow
    -0.15
    etting
    -0.15
    nex
    -0.15
    oca
    -0.15
    thalm
    -0.14
    wm
    -0.14
    POSITIVE LOGITS
    wide
    0.16
    lef
    0.15
    Wide
    0.14
    ê°Ħ
    0.14
    iversit
    0.14
     доÑĤ
    0.13
    _own
    0.13
     Wide
    0.13
    -wide
    0.13
     ActiveSupport
    0.13
    Act Density 0.068%

    No Known Activations