INDEX
    Explanations

    syntactical structures and programming language elements

    New Auto-Interp
    Negative Logits
    ÙĪØ§ÙĨ
    -0.14
     overlaps
    -0.14
     Zhang
    -0.14
    yme
    -0.14
    haft
    -0.14
    uben
    -0.14
     Abrams
    -0.13
    413
    -0.13
    ires
    -0.13
    ina
    -0.13
    POSITIVE LOGITS
     comment
    0.44
     comments
    0.40
     Comment
    0.40
     Komment
    0.35
    comment
    0.35
    Comment
    0.34
     Comments
    0.34
    -comment
    0.34
     commented
    0.34
     komment
    0.33
    Act Density 0.154%

    No Known Activations