INDEX
    Explanations

    structural elements and organization in technical descriptions

    New Auto-Interp
    Negative Logits
    heads
    -0.15
    rise
    -0.14
    elib
    -0.14
    ],[-
    -0.13
    ÑĪин
    -0.13
    ze
    -0.13
    udu
    -0.13
     Heads
    -0.13
    oxid
    -0.13
    izard
    -0.12
    POSITIVE LOGITS
     bottom
    1.13
     Bottom
    0.99
    bottom
    0.96
    Bottom
    0.93
     BOTTOM
    0.86
    -bottom
    0.85
    _bottom
    0.81
     bottoms
    0.79
    .bottom
    0.75
    BOTTOM
    0.75
    Act Density 0.129%

    No Known Activations