INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    (Index
    -0.07
    Fail
    -0.06
     Coul
    -0.06
    _FILL
    -0.06
     نمود
    -0.06
    .receiver
    -0.06
    Plant
    -0.06
    Prefix
    -0.06
     zel
    -0.06
    699
    -0.06
    POSITIVE LOGITS
    accepted
    0.07
    anny
    0.07
     internship
    0.07
     mlad
    0.07
     recru
    0.07
     civ
    0.07
    терн
    0.07
    (an
    0.07
     dumping
    0.07
     Dayton
    0.07
    Act Density 0.005%

    No Known Activations