INDEX
    Explanations

    phrases related to statements or verbal communication

    instances of the verb "say" and its variations

    New Auto-Interp
    Negative Logits
    ngth
    -0.80
    ãĤ¼
    -0.78
    ãĤ´ãĥ³
    -0.64
    Ĥª
    -0.64
     Nanto
    -0.64
    Ö¼
    -0.63
    âĹ¼
    -0.61
    ptic
    -0.60
    ãĥ¯
    -0.59
    Tur
    -0.57
    POSITIVE LOGITS
     definitively
    1.31
     anything
    1.30
     whether
    1.22
     aloud
    1.06
     unequivocally
    1.06
    anything
    1.04
     exactly
    1.03
     goodbye
    1.00
     explicitly
    0.99
     why
    0.96
    Act Density 0.052%

    No Known Activations