INDEX
    Explanations

    words related to various forms of possession and ownership

    New Auto-Interp
    Negative Logits
    å¹
    -0.17
    pering
    -0.14
    lier
    -0.14
     Props
    -0.14
     Discipline
    -0.14
    ieres
    -0.13
    itta
    -0.13
     Hal
    -0.13
    ãģ
    -0.13
     Injection
    -0.13
    POSITIVE LOGITS
    cki
    0.16
    coe
    0.16
    ãĤ¤ãĥĦ
    0.15
    ниÑĩ
    0.15
    dsa
    0.15
    jsc
    0.14
    деÑĢ
    0.14
    zes
    0.14
    öh
    0.14
    ihn
    0.14
    Act Density 0.255%

    No Known Activations