INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     ensuing
    -0.08
    _encoded
    -0.08
    636
    -0.08
    ਾਨਕ
    -0.07
     során
    -0.07
    _ws
    -0.07
     Alf
    -0.07
    _encoding
    -0.07
    _encode
    -0.07
     당시
    -0.07
    POSITIVE LOGITS
     affairs
    0.08
     đạo
    0.08
     workable
    0.08
    0.08
    Fd
    0.07
    FY
    0.07
     brass
    0.07
     Abe
    0.07
     eryth
    0.07
    דו
    0.07
    Act Density 0.027%

    No Known Activations