INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ld
    0.39
     lage
    0.38
     quoted
    0.38
     quote
    0.38
     Tun
    0.37
    ldigt
    0.37
     Kir
    0.37
     Heritage
    0.37
     Hum
    0.36
     Taman
    0.36
    POSITIVE LOGITS
    👜
    0.41
     recours
    0.41
    InterfaceLine
    0.38
     работой
    0.38
     ফ্লাই
    0.37
    сматри
    0.36
    0.36
    0.36
    жение
    0.36
     прежде
    0.36
    Act Density 0.001%

    No Known Activations