INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     resilience
    -0.07
     cylinder
    -0.06
    println
    -0.06
     yerel
    -0.06
     dwar
    -0.06
    ρή
    -0.06
     تماس
    -0.06
    Increasing
    -0.06
    .DateTimeField
    -0.06
    iddles
    -0.06
    POSITIVE LOGITS
     Permission
    0.10
    permission
    0.07
    .Permission
    0.07
     عکس
    0.07
    <::
    0.06
     cz
    0.06
     debated
    0.06
    istic
    0.06
     PERMISSION
    0.06
     detalle
    0.06
    Act Density 0.001%

    No Known Activations