INDEX
    Explanations

    phrases related to quotation marks and dialogue attribution

    statements that convey a sense of uncertainty or reflection on past events

    New Auto-Interp
    Negative Logits
    cano
    -0.77
     censor
    -0.75
     whip
    -0.75
    arde
    -0.73
     regenerate
    -0.70
     endeav
    -0.69
     realised
    -0.69
     censored
    -0.68
    inous
    -0.68
     maths
    -0.67
    POSITIVE LOGITS
    Indeed
    1.14
    Contribut
    1.11
    Attempts
    1.06
    Newsletter
    1.00
    Researchers
    0.99
    Eventually
    0.97
    RELATED
    0.97
    Despite
    0.96
    Fortunately
    0.94
    Still
    0.94
    Act Density 0.189%

    No Known Activations