INDEX
    Explanations

    words related to external qualities and actions

    New Auto-Interp
    Negative Logits
    лин
    -0.17
    lez
    -0.16
    fulness
    -0.16
    ÑĢиÑĦ
    -0.15
    aptor
    -0.15
     Nay
    -0.15
     Omn
    -0.15
    ql
    -0.15
    azzi
    -0.15
    INAL
    -0.14
    POSITIVE LOGITS
    ensive
    0.31
    remely
    0.30
    inction
    0.29
    /ext
    0.26
    rem
    0.26
     ext
    0.25
    (ext
    0.24
    ending
    0.24
    ender
    0.23
    ention
    0.23
    Act Density 0.012%

    No Known Activations