INDEX
    Explanations

    names of people and entities

    New Auto-Interp
    Negative Logits
     himself
    -0.15
     Himself
    -0.12
     his
    -0.10
    his
    -0.10
     sám
    -0.09
     seinen
    -0.09
    ä»ĸçļĦ
    -0.08
     Ø®ÙĪØ¯Ø´
    -0.08
     его
    -0.08
     seiner
    -0.07
    POSITIVE LOGITS
     alike
    0.20
     respectively
    0.19
     respective
    0.14
     themselves
    0.14
     ê°ģê°ģ
    0.12
     their
    0.11
    åĪĨåĪ«
    0.10
     sowie
    0.10
     Their
    0.10
     both
    0.10
    Act Density 0.086%

    No Known Activations