INDEX
Explanations
instances of people acknowledging or confessing to certain actions or thoughts
occurrences of the word "he" and variations related to admissions or acknowledgments
New Auto-Interp
Negative Logits
icles
-0.75
Review
-0.71
Liber
-0.69
lake
-0.68
Friends
-0.68
cellaneous
-0.66
Ec
-0.65
clock
-0.64
blogspot
-0.64
Budd
-0.63
POSITIVE LOGITS
mistakes
0.97
regrets
0.91
underestimated
0.74
wrongdoing
0.73
misled
0.73
underest
0.73
unintentional
0.71
conflicted
0.71
mishand
0.70
imperfect
0.70
Activations Density 0.132%