INDEX
    Explanations

    Auxiliary verbs

    New Auto-Interp
    Negative Logits
    -0.07
    -0.07
     тебе
    -0.07
    -0.07
     sublicense
    -0.07
    nodes
    -0.07
     emiss
    -0.07
     shining
    -0.07
    mamak
    -0.06
    قال
    -0.06
    POSITIVE LOGITS
    ieves
    0.06
    ureau
    0.06
     Musk
    0.06
    _ROUND
    0.06
    ادی
    0.06
     Passenger
    0.06
     narrowly
    0.05
    .coordinate
    0.05
     listop
    0.05
    \\\
    0.05
    Act Density 0.206%

    No Known Activations