INDEX
    Explanations

    direct speech marked with quotations

    dialogue punctuation, particularly the use of quotation marks

    New Auto-Interp
    Negative Logits
    avorite
    -0.89
    irtual
    -0.71
    ¥ŀ
    -0.69
     vide
    -0.68
     Lauder
    -0.67
     deterrent
    -0.65
     satell
    -0.65
    phthal
    -0.64
     triv
    -0.64
     subsid
    -0.62
    POSITIVE LOGITS
    Oh
    1.01
    Hey
    0.99
    hey
    0.94
    Sir
    0.88
    Yo
    0.85
    I
    0.84
    cause
    0.83
    Everybody
    0.83
    Wait
    0.81
    Let
    0.80
    Act Density 0.087%

    No Known Activations