INDEX
    Explanations

    quoted speech or dialogue within the text

    New Auto-Interp
    Negative Logits
     adjud
    -0.87
     favor
    -0.76
     cram
    -0.76
     midterm
    -0.73
     cannabin
    -0.73
     resettlement
    -0.72
     distribut
    -0.72
     derby
    -0.71
     aggreg
    -0.71
     scheduled
    -0.70
    POSITIVE LOGITS
    We
    1.21
    I
    1.17
    Our
    1.10
    Dear
    1.06
    Hey
    1.06
    Absolutely
    1.06
    There
    1.05
    Everyone
    1.05
    It
    1.04
    Hello
    1.02
    Act Density 0.064%

    No Known Activations