INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    4
    -0.07
     прой
    -0.07
    26
    -0.07
    04
    -0.07
    6
    -0.07
     Diablo
    -0.07
    Star
    -0.07
    -fly
    -0.07
     developer
    -0.07
    or
    -0.07
    POSITIVE LOGITS
     sentence
    0.10
    _sentence
    0.09
    (sentence
    0.08
    sentence
    0.08
    case
    0.08
     sentences
    0.08
     Sentence
    0.08
    _sentences
    0.07
     sudden
    0.07
    είται
    0.07
    Act Density 0.026%

    No Known Activations