INDEX
    Explanations

    instances of the word "instead" in various contexts

    New Auto-Interp
    Negative Logits
    enity
    -0.16
    oshi
    -0.15
    uckle
    -0.14
    ogl
    -0.14
    loo
    -0.14
     Marino
    -0.14
    صد
    -0.14
     already
    -0.14
    TEGER
    -0.13
    elin
    -0.13
    POSITIVE LOGITS
    instead
    0.17
    chie
    0.16
    bes
    0.16
     instead
    0.15
    ĶåĽŀ
    0.14
    رد
    0.14
    ewe
    0.14
    Instead
    0.14
     Instead
    0.14
    swith
    0.13
    Act Density 0.019%

    No Known Activations