INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     going
    -1.97
    going
    -1.70
    Going
    -1.65
     Going
    -1.65
     go
    -1.58
     goes
    -1.51
     GOING
    -1.49
    goes
    -1.34
     goin
    -1.34
    GOING
    -1.33
    POSITIVE LOGITS
     into
    0.75
     to
    0.68
    yntaxException
    0.67
     ffilmiau
    0.65
    WriteAttribute
    0.63
     forward
    0.63
     onto
    0.63
    Rujuakan
    0.62
    AutoScaleMode
    0.61
    verwijspagina
    0.61
    Act Density 0.072%

    No Known Activations