INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     rant
    -0.08
     slip
    -0.07
     cardboard
    -0.07
    ندر
    -0.07
    ff
    -0.07
    _tt
    -0.07
     hooking
    -0.07
     undue
    -0.06
    .webp
    -0.06
     tram
    -0.06
    POSITIVE LOGITS
    abolism
    0.08
     Catholics
    0.08
     ampla
    0.07
     Atlantis
    0.07
     acre
    0.07
     Kate
    0.07
     Oxford
    0.07
    .SP
    0.07
     Agar
    0.07
     Ceramic
    0.07
    Act Density 0.002%

    No Known Activations