INDEX
    Explanations

    phrases starting with the word "Wh"

    occurrences of the substring "Wh"

    New Auto-Interp
    Negative Logits
    WARE
    -0.79
    phrine
    -0.79
    uated
    -0.76
    ATIONS
    -0.73
    uating
    -0.70
    Reloaded
    -0.70
    DEN
    -0.70
     Blazers
    -0.68
    steen
    -0.66
    KEN
    -0.65
    POSITIVE LOGITS
    istle
    1.25
    ilst
    1.25
    olly
    1.22
    irlwind
    1.21
    ispers
    1.15
    isky
    1.08
    atson
    1.07
    soever
    1.07
    ither
    1.02
    izz
    1.02
    Act Density 0.011%

    No Known Activations