INDEX
    Explanations

    possessive pronouns

    New Auto-Interp
    Negative Logits
    /FM
    -0.08
    :UI
    -0.08
    Rotate
    -0.08
     brochure
    -0.08
    -Shirt
    -0.08
     übers
    -0.08
    .Rotate
    -0.08
    이어
    -0.07
    _DECL
    -0.07
    inthe
    -0.07
    POSITIVE LOGITS
     neighbors
    0.09
     partners
    0.09
     colleagues
    0.09
     coworkers
    0.09
     ಮಕ್ಕ
    0.09
     Dad
    0.09
    0.09
     mentors
    0.09
    0.09
     children
    0.09
    Act Density 0.110%

    No Known Activations