INDEX
Explanations
phrases that express strong opinions or evaluations, often using words like "hard," "better," "good," "open," "well," "sick," "welcome," "prepared," and "advised."
phrases indicating difficulty or challenges in achieving something
New Auto-Interp
Negative Logits
innocuous
-0.67
deform
-0.65
succession
-0.64
ulence
-0.62
separat
-0.62
combustion
-0.61
intimacy
-0.61
geries
-0.60
unrem
-0.60
Combine
-0.59
POSITIVE LOGITS
aware
1.09
pleased
1.09
cerned
1.09
interested
1.02
willing
1.01
aware
0.98
obliged
0.98
convinced
0.97
interested
0.96
delighted
0.96
Activations Density 0.469%