INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ild
    -0.07
    Already
    -0.07
     рівня
    -0.07
    statement
    -0.06
     EVEN
    -0.06
     g
    -0.06
    tha
    -0.06
    -even
    -0.06
    ота
    -0.06
    iya
    -0.06
    POSITIVE LOGITS
     ancestral
    0.30
    stral
    0.18
    .connector
    0.09
    Master
    0.07
     قي
    0.07
    	instance
    0.07
    _Native
    0.06
    .Pixel
    0.06
     neurological
    0.06
     anmeld
    0.06
    Act Density 0.000%

    No Known Activations