INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     dissoci
    -0.08
     requisite
    -0.08
     tabs
    -0.07
     Gian
    -0.07
     جي
    -0.07
     bas
    -0.07
     أج
    -0.07
     barring
    -0.07
     CNS
    -0.07
     가진
    -0.07
    POSITIVE LOGITS
     Newton
    0.09
    Newton
    0.08
    Colors
    0.08
    0.07
     Charlotte
    0.06
    Oregon
    0.06
     intval
    0.06
    INNER
    0.06
    EXPR
    0.06
     stores
    0.06
    Act Density 0.024%

    No Known Activations