INDEX
    Explanations

    phrases indicating personal beliefs, statements, or actions attributed to specific individuals

    repeated mentions of the subject "he."

    New Auto-Interp
    Negative Logits
     Dome
    -0.66
    odder
    -0.66
     Gad
    -0.63
    cial
    -0.60
     Fashion
    -0.59
     Mandatory
    -0.59
     Plaint
    -0.59
     Veil
    -0.58
    iac
    -0.58
     Houses
    -0.57
    POSITIVE LOGITS
    'd
    1.08
    'll
    0.91
     personally
    0.88
     encount
    0.87
    've
    0.83
     regretted
    0.80
     regrets
    0.78
     unres
    0.77
    aps
    0.77
     awoke
    0.75
    Act Density 0.161%

    No Known Activations