INDEX
    Explanations

    references to academic institutions and educational contexts

    New Auto-Interp
    Negative Logits
     his
    -1.61
    his
    -1.49
     istrinya
    -1.21
    His
    -1.10
     seinen
    -1.07
    HIS
    -1.05
     seine
    -1.02
     그의
    -1.01
     HIS
    -1.00
     seinem
    -0.99
    POSITIVE LOGITS
     he
    2.61
    He
    1.72
     он
    1.66
     He
    1.56
     він
    1.37
     she
    1.19
    he
    1.17
     הוא
    1.13
     HE
    1.07
     hee
    1.02
    Act Density 0.616%

    No Known Activations