INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     ながら
    0.41
    گیز
    0.38
     সম্ভবত
    0.37
    之所以
    0.37
     zwar
    0.37
    بیه
    0.36
    addObject
    0.36
    0.36
     Plenty
    0.35
     Possibly
    0.34
    POSITIVE LOGITS
     slightest
    1.98
    哪怕
    1.66
     even
    1.52
     tini
    1.49
     даже
    1.46
    even
    1.37
     smallest
    1.34
     حتی
    1.30
     навіть
    1.30
     EVEN
    1.25
    Act Density 0.038%

    No Known Activations