INDEX
    Explanations

    words indicating possession or ownership

    New Auto-Interp
    Negative Logits
    gee
    -0.15
    lac
    -0.14
     BOX
    -0.14
     HÃł
    -0.14
    859
    -0.14
    aná
    -0.14
     Fountain
    -0.13
     Hob
    -0.13
     Box
    -0.13
    wing
    -0.13
    POSITIVE LOGITS
    comma
    0.16
    .localized
    0.16
    erken
    0.15
    ãĥ¼ãĥĵ
    0.15
    itler
    0.15
     Verde
    0.14
    hap
    0.14
    aland
    0.14
     ÑĢоÑģ
    0.14
    stantiate
    0.14
    Act Density 0.015%

    No Known Activations