INDEX
    Explanations

    instances of the word "instead"

    New Auto-Interp
    Negative Logits
     "));
    -0.93
    "));
    
    -0.85
    AxisAlignment
    -0.82
    *}$
    -0.79
    Oise
    -0.78
     SAK
    -0.77
     
    -0.77
    Portail
    -0.76
    StatusOK
    -0.75
    er
    -0.75
    POSITIVE LOGITS
     Instead
    1.08
    Instead
    1.01
     instead
    0.96
    instead
    0.95
    uttosto
    0.88
    katapos
    0.82
     Rather
    0.73
    enseits
    0.70
     SUBST
    0.68
     Oltre
    0.67
    Act Density 0.151%

    No Known Activations