INDEX
    Explanations

    curly braces and indentation patterns in code

    New Auto-Interp
    Negative Logits
    ät
    -0.15
    ddl
    -0.15
    ÑĤÑı
    -0.15
    cono
    -0.14
    ãĤĩ
    -0.14
     panc
    -0.14
     Luo
    -0.13
     Chung
    -0.13
     Sink
    -0.13
    osu
    -0.13
    POSITIVE LOGITS
    strup
    0.17
    ovnÃŃ
    0.15
    RIX
    0.15
    amburger
    0.15
    rž
    0.14
    chy
    0.14
     oversh
    0.14
     miscon
    0.14
    ANCH
    0.14
    anch
    0.14
    Act Density 0.000%

    No Known Activations