INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    elang
    -0.08
     interruption
    -0.07
     calmly
    -0.07
     Кор
    -0.07
     unic
    -0.07
    uchen
    -0.07
     Neue
    -0.07
     Nearby
    -0.07
    uthe
    -0.07
     Neutral
    -0.07
    POSITIVE LOGITS
     باید
    0.08
     talento
    0.08
     talents
    0.08
    malıdır
    0.08
    0.08
    antwoord
    0.08
    hasilan
    0.07
    انی
    0.07
    രണം
    0.07
    579
    0.07
    Act Density 0.022%

    No Known Activations