INDEX
    Explanations

    nouns and verbs that relate to actions and characteristics

    New Auto-Interp
    Negative Logits
    itar
    -0.15
    æķ·
    -0.14
    å®ħ
    -0.14
    toolbox
    -0.14
    edy
    -0.13
    ussed
    -0.13
    anic
    -0.13
    ischen
    -0.13
     Unary
    -0.13
    oho
    -0.13
    POSITIVE LOGITS
    bote
    0.18
    ~/
    0.16
    readcr
    0.14
    DMI
    0.14
    ãĥ¼ãĥł
    0.13
    ø
    0.13
    _owned
    0.13
    ováno
    0.13
    rame
    0.13
    _Tis
    0.13
    Act Density 0.072%

    No Known Activations