INDEX
    Explanations

    concept that describes or specifies

    New Auto-Interp
    Negative Logits
     ним
    0.49
     них
    0.49
     ها
    0.49
    టువంటి
    0.48
     случае
    0.48
    ИС
    0.48
     имеется
    0.48
    ക്ഷ
    0.47
    っていた
    0.47
     присутствует
    0.47
    POSITIVE LOGITS
     that
    1.54
     which
    1.31
     whose
    1.21
    that
    1.17
     thats
    1.13
     التي
    1.12
     reminiscent
    1.10
     resembling
    1.06
     που
    1.05
    which
    1.05
    Act Density 3.578%

    No Known Activations