INDEX
    Explanations

    phrases related to personal ownership and customization

    New Auto-Interp
    Negative Logits
    dle
    -0.16
    ryn
    -0.15
    ingly
    -0.14
    endo
    -0.14
    dad
    -0.14
    ailable
    -0.14
    βά
    -0.14
    225
    -0.13
    enas
    -0.13
    /react
    -0.13
    POSITIVE LOGITS
     own
    0.32
     Own
    0.21
    Own
    0.21
     respective
    0.20
     próp
    0.19
     eigenen
    0.18
    à¹Ģà¸Ńà¸ĩ
    0.17
     OWN
    0.17
     propia
    0.16
    own
    0.16
    Act Density 0.026%

    No Known Activations