INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     مطالعه
    -0.07
     launcher
    -0.07
    _PULL
    -0.06
     bus
    -0.06
    Extensions
    -0.06
     Water
    -0.06
     Computing
    -0.06
     gratuit
    -0.06
    _direct
    -0.06
     Degrees
    -0.06
    POSITIVE LOGITS
                                               
    0.07
    (buf
    0.07
     continually
    0.07
    ropy
    0.06
    0.06
    (CH
    0.06
                                                                
    0.06
     لی
    0.06
    JS
    0.06
    	js
    0.06
    Act Density 0.014%

    No Known Activations