INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Sit
    -0.07
    olvimento
    -0.07
     struck
    -0.06
    reeting
    -0.06
     очевид
    -0.06
     مثلا
    -0.06
    agens
    -0.06
     Rachel
    -0.06
     biggest
    -0.06
     cestu
    -0.06
    POSITIVE LOGITS
     ViewBag
    0.07
    .translation
    0.07
     Nur
    0.07
     κορ
    0.06
    .variables
    0.06
     vypad
    0.06
     Bowen
    0.06
     supers
    0.06
    จร
    0.06
    0.06
    Act Density 0.009%

    No Known Activations