INDEX
    Explanations

    any form of the word "wear" or related terms

    New Auto-Interp
    Negative Logits
    ri
    -0.18
    res
    -0.18
    v
    -0.18
    sel
    -0.17
    ref
    -0.16
    re
    -0.16
    sh
    -0.16
    ritt
    -0.16
    ae
    -0.16
    si
    -0.16
    POSITIVE LOGITS
    preneur
    0.21
    ments
    0.21
    chts
    0.20
    xit
    0.19
    ddie
    0.19
    Ìģ
    0.19
    deriv
    0.19
    trie
    0.19
    ngth
    0.19
    ngthen
    0.18
    Act Density 0.044%

    No Known Activations