INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ו
    1.11
    ל
    0.93
    ة
    0.90
     la
    0.84
    <0x0D>
    0.83
    то
    0.82
    й
    0.82
    د
    0.82
    ת
    0.82
    0.81
    POSITIVE LOGITS
     اخرى
    0.80
    Hasil
    0.76
    un
    0.74
    fe
    0.72
    ভাবে
    0.71
     Fe
    0.70
    Fe
    0.70
     anderer
    0.66
    readable
    0.66
    Altri
    0.64
    Act Density 0.000%

    No Known Activations