INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     مرض
    -0.09
     univer
    -0.08
     grot
    -0.08
     Universiteit
    -0.08
     reprodução
    -0.08
     semin
    -0.08
     monopoly
    -0.08
     scammers
    -0.07
     Universidade
    -0.07
    krut
    -0.07
    POSITIVE LOGITS
     орналас
    0.10
     располож
    0.10
     placement
    0.10
     расположен
    0.10
     위치
    0.10
    Placement
    0.10
     positioned
    0.09
     байр
    0.09
    .position
    0.09
     Placement
    0.09
    Act Density 0.001%

    No Known Activations