INDEX
    Explanations

    discussions on variants and their effectiveness in various contexts

    New Auto-Interp
    Negative Logits
    contri
    -0.16
     Gest
    -0.15
    rung
    -0.14
    653
    -0.14
    ìĦłê±°
    -0.14
     Poz
    -0.14
    ÙħÙĬ
    -0.14
    .Formatter
    -0.14
     Lei
    -0.14
    hoa
    -0.13
    POSITIVE LOGITS
     adequate
    0.34
     suffice
    0.30
     satisfactory
    0.30
     sufficient
    0.29
     adequ
    0.29
     suff
    0.27
     Ade
    0.26
    ade
    0.26
     acceptable
    0.25
    è¶³
    0.24
    Act Density 0.254%

    No Known Activations