INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    تك
    0.85
    6
    0.80
    ானா
    0.79
    8
    0.79
    9
    0.74
    ات
    0.72
    าย
    0.71
    д
    0.70
    סה
    0.68
    5
    0.67
    POSITIVE LOGITS
    i
    0.84
    0.75
    ;
    0.71
     Anglo
    0.68
    0.64
    o
    0.62
            
    0.61
    <0x0D>
    0.61
     If
    0.60
       
    0.60
    Act Density 0.003%

    No Known Activations