INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Hochspringen
    -0.70
     <=",
    -0.58
     createState
    -0.56
    so
    -0.55
    findpost
    -0.53
    tagHelperRunner
    -0.51
     propOrder
    -0.50
     []:
    -0.49
    XmlAccessorType
    -0.49
    Przypisy
    -0.49
    POSITIVE LOGITS
     Hanno
    0.64
    σθαι
    0.64
    daction
    0.59
     Marais
    0.58
     StringTokenizer
    0.56
     副本
    0.56
    dehyde
    0.54
     EnglishChoose
    0.54
    NUMX
    0.54
    ároz
    0.54
    Act Density 0.057%

    No Known Activations