INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ס
    1.41
    ك
    1.28
    ను
    1.26
    ت
    1.18
    el
    1.16
    ל
    1.16
    ص
    1.15
    מ
    1.13
    ри
    1.13
    ا
    1.11
    POSITIVE LOGITS
    o
    1.00
    s
    0.99
    q
    0.89
    r
    0.89
    0.84
    0.83
    e
    0.82
    u
    0.79
     prestación
    0.79
     функ
    0.78
    Act Density 0.000%

    No Known Activations