INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     ΕΛ
    -0.07
     سازمان
    -0.07
     sal
    -0.07
     Gül
    -0.06
     Laguna
    -0.06
     flowing
    -0.06
    ($('#
    -0.06
     libro
    -0.06
    ipated
    -0.06
    NamedQuery
    -0.06
    POSITIVE LOGITS
     print
    0.12
     Print
    0.10
    Print
    0.10
     Prints
    0.08
     prints
    0.08
     :=↵
    0.07
    "url
    0.07
    ]}↵
    0.07
     printed
    0.07
    ічна
    0.06
    Act Density 0.005%

    No Known Activations