INDEX
    Explanations

    possessive pronouns

    New Auto-Interp
    Negative Logits
    Gender
    -0.07
     besch
    -0.07
     '.
    -0.07
    	P
    -0.07
     minerals
    -0.06
    integral
    -0.06
    Problem
    -0.06
    talk
    -0.06
    -wh
    -0.06
    "M
    -0.06
    POSITIVE LOGITS
     Dominic
    0.06
     процес
    0.06
     reproduce
    0.06
    xlsx
    0.06
    itorio
    0.06
    erosis
    0.06
    아요
    0.06
     расч
    0.06
     metaData
    0.06
     pione
    0.06
    Act Density 0.102%

    No Known Activations