INDEX
    Explanations

    instances of people acknowledging or confessing to certain actions or thoughts

    occurrences of the word "he" and variations related to admissions or acknowledgments

    New Auto-Interp
    Negative Logits
    icles
    -0.75
    Review
    -0.71
    Liber
    -0.69
    lake
    -0.68
    Friends
    -0.68
    cellaneous
    -0.66
    Ec
    -0.65
    clock
    -0.64
    blogspot
    -0.64
     Budd
    -0.63
    POSITIVE LOGITS
     mistakes
    0.97
     regrets
    0.91
     underestimated
    0.74
     wrongdoing
    0.73
     misled
    0.73
     underest
    0.73
     unintentional
    0.71
     conflicted
    0.71
     mishand
    0.70
     imperfect
    0.70
    Act Density 0.132%

    No Known Activations