INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    572
    -0.07
    Дата
    -0.07
     aria
    -0.06
     verbs
    -0.06
    Digits
    -0.06
     Đào
    -0.06
     пропози
    -0.06
    ика
    -0.06
    	filename
    -0.06
     embroid
    -0.06
    POSITIVE LOGITS
     hopefully
    0.10
     Hopefully
    0.10
    Hopefully
    0.09
     surely
    0.07
    hopefully
    0.07
    _KIND
    0.07
    _Callback
    0.07
    REF
    0.07
     hoping
    0.06
    сыл
    0.06
    Act Density 0.005%

    No Known Activations