INDEX
    Explanations

    html tags and structure

    New Auto-Interp
    Negative Logits
    ة
    0.89
    ي
    0.80
    ان
    0.71
    ла
    0.67
    an
    0.67
    ینده
    0.66
    ת
    0.66
    ק
    0.65
    ing
    0.64
    יות
    0.62
    POSITIVE LOGITS
     I
    0.84
     n
    0.65
     
    0.62
     U
    0.59
     R
    0.57
    RAM
    0.57
    _{
    0.56
     K
    0.55
    0.55
     N
    0.55
    Act Density 0.003%

    No Known Activations