INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Nir
    -0.07
     ejected
    -0.07
     flowering
    -0.06
     influence
    -0.06
    On
    -0.06
     Thus
    -0.06
     ICE
    -0.06
     труда
    -0.06
    .program
    -0.06
     bilingual
    -0.06
    POSITIVE LOGITS
     desperate
    0.16
     desperately
    0.13
     desperation
    0.12
     urgency
    0.07
     crackdown
    0.07
    dpi
    0.07
     dying
    0.07
    .RESULT
    0.07
    stack
    0.07
     urgent
    0.07
    Act Density 0.004%

    No Known Activations