INDEX
    Explanations

    function definitions and control structures, particularly in programming code

    New Auto-Interp
    Negative Logits
    ochem
    -0.54
     nó
    -0.45
    isma
    -0.43
    an
    -0.42
    atro
    -0.41
    esos
    -0.41
    zkiem
    -0.40
    рованные
    -0.39
     trop
    -0.39
    cionar
    -0.39
    POSITIVE LOGITS
     self
    3.22
    self
    3.15
     Self
    2.42
    Self
    2.31
     SELF
    2.16
     selves
    1.96
    SELF
    1.85
     Selbst
    1.84
     herself
    1.74
     zelf
    1.69
    Act Density 0.054%

    No Known Activations