INDEX
Explanations
phrases related to improvement and performing well
evaluations of performance or quality, particularly related to improvement and doing well
New Auto-Interp
Negative Logits
jected
-0.72
urated
-0.69
Tru
-0.68
ãĤ¦ãĤ¹
-0.68
severed
-0.65
Personality
-0.64
Tav
-0.63
Cutter
-0.63
cracked
-0.63
gerald
-0.63
POSITIVE LOGITS
job
0.79
grunt
0.75
chores
0.72
entreprene
0.71
offline
0.70
homework
0.70
FUL
0.70
repay
0.67
injustice
0.66
enrich
0.66
Activations Density 0.066%