INDEX
    Explanations

    instruction and requirement

    New Auto-Interp
    Negative Logits
    nh
    1.14
     Useful
    1.13
     полез
    1.06
    פור
    1.05
     надеюсь
    0.95
    kep
    0.95
     uncontroll
    0.94
    нде
    0.94
    etest
    0.93
    Newline
    0.93
    POSITIVE LOGITS
     dringend
    1.48
     reminding
    1.20
    用到
    1.19
     arises
    1.18
     arise
    1.17
     urgently
    1.17
    iness
    1.17
     asap
    1.16
     urgente
    1.15
    iest
    1.13
    Act Density 0.135%

    No Known Activations