INDEX
    Explanations

    direct quotations in text

    quotation marks and direct speech in text

    New Auto-Interp
    Negative Logits
     nod
    -0.69
     lair
    -0.63
     nodd
    -0.62
     rundown
    -0.61
     fray
    -0.61
     salute
    -0.60
     Versus
    -0.60
     alias
    -0.58
     playbook
    -0.57
    lled
    -0.57
    POSITIVE LOGITS
    there
    1.27
    nob
    1.15
    they
    1.06
    someone
    1.05
    everyone
    1.02
    these
    1.01
    many
    1.01
    when
    0.94
    every
    0.93
    despite
    0.91
    Act Density 0.217%

    No Known Activations