INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    special
    -0.37
    setattr
    -0.36
    get
    -0.36
    s
    -0.35
    fB
    -0.34
    unfinished
    -0.34
     Ful
    -0.33
    working
    -0.33
    inc
    -0.33
     …”
    -0.32
    POSITIVE LOGITS
     Khan
    2.55
    Khan
    2.39
     khan
    2.05
    khan
    1.59
     Kahn
    1.52
     خان
    1.27
     Хан
    0.97
    خان
    0.84
     glyphicon
    0.80
    Xna
    0.78
    Act Density 0.003%

    No Known Activations