INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     TG
    -0.08
     Typed
    -0.07
     Tele
    -0.07
     Sig
    -0.07
     Eur
    -0.07
    .equalsIgnoreCase
    -0.07
     cases
    -0.07
    .Graph
    -0.06
     Reads
    -0.06
     borderTop
    -0.06
    POSITIVE LOGITS
    0.06
    rites
    0.06
     tendencies
    0.06
     newUser
    0.06
     suas
    0.06
    products
    0.06
     paar
    0.06
     odio
    0.06
    יבל
    0.06
    socket
    0.06
    Act Density 0.001%

    No Known Activations