INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Fu
    -0.08
    inf
    -0.06
    .dispatch
    -0.06
    pf
    -0.06
    })↵↵↵
    -0.06
    .transaction
    -0.06
    Hint
    -0.06
     qué
    -0.06
                                                          
    -0.06
    -0.06
    POSITIVE LOGITS
     vertical
    0.15
    vertical
    0.11
     Vertical
    0.11
    Vertical
    0.10
    -vertical
    0.09
     vertically
    0.09
    0.08
    (vertical
    0.08
    _vertical
    0.08
    .vertical
    0.07
    Act Density 0.005%

    No Known Activations