INDEX
    Explanations

    phrases related to ownership or possession

    New Auto-Interp
    Negative Logits
    inia
    -0.16
    arts
    -0.15
    olf
    -0.15
    SKI
    -0.14
    igli
    -0.14
    нÑĮо
    -0.14
    spb
    -0.14
    oglob
    -0.14
     âĵĺ
    -0.13
    ÃŁe
    -0.13
    POSITIVE LOGITS
    asco
    0.16
     Bout
    0.15
    azer
    0.14
     Hum
    0.14
    Hum
    0.14
    ahas
    0.14
     induction
    0.13
    WXYZ
    0.13
    /is
    0.13
    ynes
    0.13
    Act Density 0.027%

    No Known Activations