INDEX
    Explanations

    phrases related to actions, decisions, and consequences

    the presence of conjunctions, particularly "and," indicating connections or additions in the text

    New Auto-Interp
    Negative Logits
    tnc
    -0.80
    ļéĨĴ
    -0.77
    Interested
    -0.75
    inent
    -0.74
    incial
    -0.71
    culus
    -0.70
    ANC
    -0.69
    Enlarge
    -0.69
    cerning
    -0.69
    rared
    -0.68
    POSITIVE LOGITS
     succeeded
    1.12
     deserve
    1.01
     rightly
    0.94
     reap
    0.93
     nobody
    0.90
     rightfully
    0.89
     waited
    0.87
     luckily
    0.87
     prevailed
    0.87
     rewarded
    0.86
    Act Density 0.221%

    No Known Activations