INDEX
    Explanations

    actions related to checking or verifying something

    occurrences of the word "check" and its variations

    New Auto-Interp
    Negative Logits
    ufact
    -0.71
    ña
    -0.67
     Hots
    -0.66
    nown
    -0.64
    joice
    -0.64
    SAY
    -0.63
    nect
    -0.62
    usable
    -0.62
    åħī
    -0.62
    asus
    -0.62
    POSITIVE LOGITS
    mate
    1.02
    lists
    0.91
    boxes
    0.89
     whether
    0.83
     out
    0.75
     boxes
    0.74
    points
    0.70
    ysis
    0.70
     balances
    0.69
    box
    0.68
    Act Density 0.033%

    No Known Activations