INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     gamble
    -0.07
     پا
    -0.07
     уров
    -0.06
     Mojo
    -0.06
    ัญห
    -0.06
     yararlan
    -0.06
    've
    -0.06
    	SP
    -0.06
     Vanguard
    -0.06
     využití
    -0.06
    POSITIVE LOGITS
     extremely
    0.10
    cia
    0.07
     examiner
    0.07
     excell
    0.07
     Imm
    0.07
     Motor
    0.07
     completely
    0.07
    kins
    0.06
     strictly
    0.06
    ……
    0.06
    Act Density 0.034%

    No Known Activations