INDEX
    Explanations

    repetitive patterns and mathematical expressions in code

    New Auto-Interp
    Negative Logits
    AndEndTag
    -0.75
    two
    -0.63
    Two
    -0.60
     OMITBAD
    -0.59
     two
    -0.58
     Two
    -0.57
    TWO
    -0.57
     deux
    -0.55
    rungsseite
    -0.54
    httphttps
    -0.54
    POSITIVE LOGITS
    Spoilers
    0.43
     potr
    0.38
     miniaturka
    0.36
     Personalis
    0.34
    SPOILERS
    0.33
     mijne
    0.32
     capilla
    0.32
    ieť
    0.32
    gotta
    0.32
     techniczne
    0.31
    Act Density 0.696%

    No Known Activations