INDEX
    Explanations

    what is and what are questions

    New Auto-Interp
    Negative Logits
    етесь
    0.41
     Whether
    0.39
     whether
    0.38
    ქვთ
    0.38
     více
    0.37
    Whether
    0.36
    ваться
    0.35
     ندار
    0.35
    емся
    0.35
    деги
    0.35
    POSITIVE LOGITS
     constitutes
    0.95
     exactly
    0.80
     constit
    0.77
     defines
    0.77
     distinguishes
    0.76
     constitute
    0.76
     exactement
    0.74
     differentiates
    0.74
    exactly
    0.72
     qualifies
    0.71
    Act Density 0.104%

    No Known Activations