INDEX
    Explanations

    instances of the word "show" in various contexts

    New Auto-Interp
    Negative Logits
    leck
    -0.17
    uyen
    -0.16
    ned
    -0.15
    epad
    -0.15
    neck
    -0.15
    unset
    -0.14
    ượng
    -0.14
    алеж
    -0.14
    nick
    -0.14
    otate
    -0.14
    POSITIVE LOGITS
    alter
    0.25
    time
    0.24
    biz
    0.23
    case
    0.22
    cases
    0.22
    piece
    0.22
    ALTER
    0.20
    stop
    0.20
    CASE
    0.20
     offs
    0.19
    Act Density 0.022%

    No Known Activations