INDEX
    Explanations

    linking verbs or identifying phrases

    New Auto-Interp
    Negative Logits
    Its
    0.88
     которое
    0.85
     Its
    0.76
     którego
    0.75
     একটি
    0.75
     sebuah
    0.67
     itself
    0.66
    一个
    0.66
     kuris
    0.65
     яке
    0.64
    POSITIVE LOGITS
     themselves
    1.46
     რომლებიც
    1.20
     considerados
    1.09
    也都
    1.07
     ඒවා
    1.07
     जिनमें
    1.05
     كلهم
    1.05
    mselves
    1.04
     आहेत
    1.02
     những
    1.02
    Act Density 0.444%

    No Known Activations