INDEX
    Explanations

    phrases and variations of the word "what."

    New Auto-Interp
    Negative Logits
     Aze
    -0.71
     Verge
    -0.68
     reminder
    -0.68
     Brea
    -0.67
     Moors
    -0.67
    erl
    -0.64
    es
    -0.63
    EES
    -0.62
     Jolie
    -0.62
     Crocodile
    -0.61
    POSITIVE LOGITS
     what
    2.27
    what
    2.11
     WHAT
    2.05
    WHAT
    1.98
    What
    1.90
     What
    1.86
    whats
    1.30
     whats
    1.17
    Whats
    1.09
     hvad
    1.09
    Act Density 0.100%

    No Known Activations