INDEX
    Explanations

    statements or quotations made by people

    New Auto-Interp
    Negative Logits
    estern
    -0.95
    ammy
    -0.71
    avorite
    -0.71
    peg
    -0.70
    OUP
    -0.67
    transfer
    -0.66
    esc
    -0.66
    ãĤ¼ãĤ¦ãĤ¹
    -0.65
    ynam
    -0.65
    asonic
    -0.65
    POSITIVE LOGITS
     "[
    0.87
     they
    0.85
     it
    0.84
     "...
    0.77
     "(
    0.75
     "'
    0.74
     "â̦
    0.73
     goodbye
    0.72
     instead
    0.70
     otherwise
    0.70
    Act Density 0.078%

    No Known Activations