INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    enius
    -0.70
     Workers
    -0.64
    urence
    -0.47
     Z
    -0.47
     ketua
    -0.46
     W
    -0.46
     ricar
    -0.43
     labor
    -0.42
    mouseup
    -0.41
    imageshack
    -0.40
    POSITIVE LOGITS
    ValueStyle
    0.99
    AndEndTag
    0.88
     MenuView
    0.87
     يتيمه
    0.86
    ########.
    0.85
     CreateTagHelper
    0.84
    ſelf
    0.81
    OGND
    0.80
     Anſ
    0.79
     leaſt
    0.79
    Act Density 0.101%

    No Known Activations