INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    🐮
    0.42
     AdWords
    0.41
    😗
    0.41
    LOUISE
    0.41
     nanoparticle
    0.40
    🍴
    0.39
     появилась
    0.39
     Errors
    0.39
    0.38
    Чтобы
    0.38
    POSITIVE LOGITS
     NR
    0.50
     streams
    0.43
    🫶
    0.41
    ebu
    0.40
     metaverse
    0.38
     quantum
    0.38
     promoters
    0.38
     skis
    0.38
    XR
    0.38
    NR
    0.37
    Act Density 0.009%

    No Known Activations