INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.07
     inadvertently
    -0.07
    .setBackgroundColor
    -0.06
     Evrop
    -0.06
    ่งข
    -0.06
     найбільш
    -0.06
     khảo
    -0.06
    .fixture
    -0.06
    _servers
    -0.06
    iltro
    -0.06
    POSITIVE LOGITS
    warf
    0.07
    Catch
    0.07
     TRUE
    0.06
    effect
    0.06
    ARENT
    0.06
     PROC
    0.06
    INLINE
    0.06
    Beh
    0.06
     derin
    0.06
     english
    0.06
    Act Density 0.011%

    No Known Activations