INDEX
    Explanations

    possessive pronouns and references describing ownership or relationship

    New Auto-Interp
    Negative Logits
    s
    -0.16
    äll
    -0.15
     Guy
    -0.15
     Conway
    -0.14
    ustum
    -0.14
     Fashion
    -0.14
    Guy
    -0.14
    sÃŃ
    -0.14
     Deck
    -0.13
     Classe
    -0.13
    POSITIVE LOGITS
    itsu
    0.15
    å´
    0.15
    çī¹èī²
    0.15
    elt
    0.14
    ittings
    0.14
    ãĤ¹ãĥ¬
    0.14
    asmus
    0.14
    ãĢħ
    0.14
    lef
    0.14
    å¿į
    0.14
    Act Density 0.211%

    No Known Activations