INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     AFTER
    -0.09
     luka
    -0.08
     ABOVE
    -0.08
    abouts
    -0.08
    תם
    -0.08
     Cpu
    -0.08
     MADE
    -0.08
    över
    -0.08
    zell
    -0.08
     FEL
    -0.08
    POSITIVE LOGITS
    Clar
    0.08
     يُ
    0.08
    Firstly
    0.08
    _local
    0.07
    Builder
    0.07
    Implement
    0.07
    -ব
    0.07
    References
    0.07
    _admin
    0.07
    _s
    0.07
    Act Density 0.630%

    No Known Activations