INDEX
    Explanations

    phrases indicating a statement or opinion on a topic

    phrases that express a statement or assertion

    New Auto-Interp
    Negative Logits
    xtap
    -0.85
    ¥ŀ
    -0.76
    hooting
    -0.75
    asu
    -0.74
    pes
    -0.72
    phrine
    -0.72
    mental
    -0.71
     conflic
    -0.71
    tele
    -0.70
    Tai
    -0.69
    POSITIVE LOGITS
     goodbye
    1.38
     hello
    1.03
     aloud
    0.89
     Goodbye
    0.87
     farewell
    0.78
    ysis
    0.74
    ieu
    0.71
     definitively
    0.69
     sorry
    0.68
    ings
    0.64
    Act Density 0.049%

    No Known Activations