INDEX
    Explanations

    instances of conditional phrases or hypothetical scenarios

    New Auto-Interp
    Negative Logits
    erts
    -0.06
    olit
    -0.06
    dle
    -0.06
    achts
    -0.06
    acht
    -0.06
    mina
    -0.06
     Hust
    -0.06
    šov
    -0.06
    illet
    -0.06
    oldt
    -0.05
    POSITIVE LOGITS
    OKIE
    0.07
    ocz
    0.07
    ometr
    0.07
    osy
    0.06
     Cotton
    0.06
    иÑģлов
    0.06
    á»įng
    0.06
     vÃŃde
    0.06
    ahren
    0.06
     Stick
    0.06
    Act Density 0.002%

    No Known Activations