INDEX
    Explanations

    instances of the word "instead."

    New Auto-Interp
    Negative Logits
     Cahill
    -0.75
    StatusOK
    -0.75
     scolaires
    -0.70
     "));
    -0.69
    Oise
    -0.68
    er
    -0.67
     oči
    -0.65
     fasi
    -0.64
    arque
    -0.64
     nourrir
    -0.64
    POSITIVE LOGITS
     Instead
    2.20
     instead
    2.19
    Instead
    2.15
    instead
    2.08
     Rather
    1.36
    Rather
    1.30
     rather
    1.28
     istället
    1.23
     вместо
    1.21
    uttosto
    1.18
    Act Density 0.167%

    No Known Activations