INDEX
Explanations
phrases related to personal reflection and self-improvement
expressions of personal reflection and inquiry
New Auto-Interp
Negative Logits
ãĥĭ
-0.77
urst
-0.75
onde
-0.71
mere
-0.69
Reviewed
-0.69
³³³³³³³³
-0.67
Blog
-0.66
Compan
-0.65
ãĤ¤
-0.63
ouched
-0.63
POSITIVE LOGITS
â̦"
0.90
..."
0.80
gonna
0.79
}"
0.76
"]
0.74
]."
0.72
."[
0.72
.""
0.71
fuckin
0.71
mentality
0.70
Activations Density 1.897%