INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     antagonist
    -0.06
     rationale
    -0.06
    уществ
    -0.06
     متعدد
    -0.06
     Parad
    -0.06
     WAN
    -0.05
     aktu
    -0.05
    _frac
    -0.05
     ให
    -0.05
     indicator
    -0.05
    POSITIVE LOGITS
     sleep
    0.12
     Sleep
    0.11
    Sleep
    0.10
     sleeping
    0.09
    LEEP
    0.08
     sleeper
    0.08
    sleep
    0.08
     asleep
    0.07
     som
    0.07
     QVBoxLayout
    0.07
    Act Density 0.015%

    No Known Activations