INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Blog
    -0.07
    ační
    -0.06
     semanas
    -0.06
    stones
    -0.06
     cooper
    -0.06
    (row
    -0.06
     wrapped
    -0.06
    ості
    -0.06
    iban
    -0.06
     oček
    -0.06
    POSITIVE LOGITS
     Natural
    0.07
    (Encoding
    0.07
    725
    0.07
    equalTo
    0.06
     Funding
    0.06
    NewItem
    0.06
    >'.
    0.06
    0.06
    .Execution
    0.06
    )}}
    0.06
    Act Density 0.018%

    No Known Activations