INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     soir
    -0.07
     cake
    -0.06
     begr
    -0.06
    									 
    -0.06
    ками
    -0.06
    -0.06
    CRT
    -0.06
    *.
    -0.06
    VIP
    -0.06
    за
    -0.06
    POSITIVE LOGITS
    supported
    0.07
     manager
    0.06
     igen
    0.06
    ondrous
    0.06
    !(:
    0.06
    ibi
    0.06
    .Pull
    0.06
    Việc
    0.06
    yu
    0.06
     parallel
    0.06
    Act Density 0.004%

    No Known Activations